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Abstract 

The transmission of digital data is one of the principal tasks in modern 
wireless communication. Classically, the communication channel consists 
of one transmitter and one receiver; however, due to the constantly in¬ 
creasing demand in higher transmission rates, the popularity of using 
several receivers and transmitters has been rapidly growing. 

In this paper, we combine a number of fairly standard techniques from 
numerical linear algebra and probability to develop several (apparently 
novel) randomized schemes for the decoding of digital messages sent over 
a noisy multivariate Gaussian channel. 

We use a popular mathematical model for such channels to illustrate 
the performance of our schemes via numerical experiments. 

Keywords: MIMO, decoding schemes, noisy Gaussian channels 
Math subject classification: 65C99, 90C27, 94A12, 94B35 

1 Introduction 

The importance of wireless communication can hardly be overestimated; our life 
nowadays is unimaginable without it. For obvious reasons, the transmission of 
digital (as opposed to analog) data is one of the principal tasks in wireless com¬ 
munication. In the case when there is one transmitter and one receiver, the sub¬ 
ject has been extensively studied for decades. More recently, however, as more 
and more data are collected and need to be rapidly transmitted, there has been 
an increasing demand in ways to boost communication performance; as a result, 
much research is being done on the use of multiple receivers/transmitters. This 
topic (generally referred to as multiple-input and multiple-output, or MIMO) is 
now at the frontier of the research in modern communications (see e.g. 

0 , m, mi, m, m, m, m- 

The transmission of digital data over a MIMO channel is typically described 
by the following model (see e.g. [IT], [H]). Suppose that m > 0 is an integer 
power of 2, and that C is a collection of m points in the complex plane (C is 
usually referred to as a ’’constellation”); each element of C corresponds to a 
unique binary word of length log 2 (m). 

Suppose also that there are n transmitters and n receivers, and that X = C m 
is the collection of all possible vectors in C m whose coordinates belong to C 
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(obviously, X contains m n vectors). Suppose, in addition, that a > 0 is a real 
number, and that H is an n by n complex matrix (the ’’channel matrix”). For 
any x in X, we define the random n— dimensional complex vector y(x) via the 
formula 


y(x) = H ■ x +a ■ (zi+i ■ z 2 ), (1) 

where z\,z 2 are independent standard normal random vectors, and i = \J— 1. 
Here x represents the transmitted message (that encodes n • log 2 (m) binary 
bits), H ■ x represents the received message in the absence of noise, and y(x) 
represents the received message corrupted by Gaussian noise of component-wise 
standard deviation a. 

In this model, the decoding problem can be formulated as follows: suppose 
that Xtrue is the transmitted message, and that y is the received message (i.e. 
the observed value of y(a,’true))- Under the assumption that H and a are known, 
one needs to find x^est in X such that 

\\H ■ Xbest - y\\ < \\H ■ x - y\\, (2) 

for any x in X. In other words, Xbest is the maximum likelihood estimate of 

*£true* 

In modern applications, any practical decoding algorithm must be quite 
rapid. For instance, the peak speed requirement for 4G mobile telecommu¬ 
nication technology is 1 Gigabit per second, and the 100 Gb/s RF Backbone 
DARPA project aims at the rate of 100 Gigabits per second (using optical wire¬ 
less communication). To complicate the matters, the channel matrix H needs 
to be frequently reevaluated (several times per second). 

In principle, the decoding problem can be solved by brute force, i.e. by 
iteratively testing all x in X. However, X contains m" vectors, which is a large 
number even for small values of m and n (e.g. m = n = 8); this makes the 
brute force approach impractical. 

Another approach is based on the observation that, due to (t!Hll . H ■ Zbest is 
the nearest neighbor of y in the collection 

HX = {H- x : x€X} (3) 

of m n complex vectors in C n ; thus, a fast nearest neighbors algorithm might be 
used to find Zbest- Such algorithms typically pre-process HX to obtain auxiliary 
data structures; then, a single y can be decoded reasonably fast (see e.g. fill '). 
However, in a typical modern application HX contains between 10 7 and 10 ln 
vectors, which makes the initial pre-processing of HX unaffordable in terms of 
both memory requirements and CPU time (see e.g. USD, especially considering 
that this calculation has to be redone every time that H changes. In other 
words, even the fastest generic nearest neighbor search in HX might not be 
fast enough. 

Yet another approach is based on the observation that, as opposed to HX, 
the collection X does not depend on H; moreover, it has a special structure 
that allows for fast nearest neighbor searches in X. Thus, one can compute x 
via the formula 


x = H 1 • y, 


(4) 


2 


and look for Xbest among several nearest neighbors of x within X. Unfortunately, 
this way the baby gets often thrown out with the bath water: Xbest will typically 
not be among any reasonable number of nearest neighbors of x unless a is 
significantly smaller than the minimal singular value X n of H. Suppose, for 
example, that n = 8, that C the so-called 8-PSK constellation of size m = 8 
(equispaced points on a unit circle), that H is a randomly generated complex 
Gaussian matrix, and that a = 0.75 (i.e. roughly twice the expected minimal 
singular value of such a matrix). Then, Xbest will coincide with x t rue i n about 
half of all cases; yet, in more than 70% of these cases Xbest will not be among 
even as many as 500 nearest neighbors of x in X. 

Some other decoding schemes and approaches can be found, for example, in 
m , nn, D3> na, m (see also references therein). 

In this project, we develop several randomized decoding schemes. Our 
schemes are based on the observation that, in typical applications, the chan¬ 
nel matrix H is not large (e.g. n = 8 or n = 16), and thus one can afford 
to evaluate its singular value decomposition (SVD). Then, we use the SVD of 
H to find £best by a rapid randomized test-and-trial procedure. Our tentative 
algorithms demonstrate reasonably good performance in several test cases (see 
Section [5]), even when a = 0(X n ). The schemes break down only when a is so 
large that the estimate Xbest defined via (El differs from the transmitted mes¬ 
sage a.’t rue with high probability (e.g. it is impossible to recover x t rue without 
additional assumptions). 

This paper is organized as follows. Section [2] contains mathematical pre¬ 
liminaries to be used in the rest of the paper. Section [3] contains a number of 
related theoretical results. In Section H we describe the decoding problem and 
present several decoding schemes. In Section [5j we illustrate the performance 
of our decoding schemes via numerical experiments. In Section [6l we present 
some conclusions and outline possible directions of future research. 

2 Mathematical Preliminaries 

In this section, we introduce notation and summarize several facts to be used 
in the rest of the paper. 

2.1 Probability 

In this section, we summarize some well known facts from the probability theory. 
These facts can be found, for example, in Q’ a, a, a- 

Suppose that x > 0 is a real number. In agreement with the standard 
practice, we define the gamma function by the formula 



(5) 


The one-dimensional standard Gaussian distribution N( 0,1) with mean zero 
and standard deviation one is defined by its probability density function (pdf) 



—oo < t < oo. 


( 6 ) 
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Its cumulative distribution function (cdf) is given by the formula 


® (I) = L e ^' 2 dl =1 ' ^ ' erfc (w) • (7) 

where erfc : R —> K. is the complementary error function defined via the formula 

erfc(i) = - 7 = 

V 71 " 



for all real x. 

Suppose that d > 0 is a positive integer. We say that the random vec¬ 
tor X has standard Gaussian d-dimensional distribution N(0d,Id), if all of its 
coordinates are independent standard Gaussian random variables. 

Suppose now that X ~ N(0d,Id)- Then ||X || 2 has distribution x 2 (d) with 

pdf 


j.d/2-1 . e -t/2 

fx 2 (d){t)= 2d/2 r(rf/2 ) ’ i>0 ’ 

where T denotes the gamma function defined via © above. 
n > 0 is a positive integer, then 


(9) 


In particular, if 


/ X 2 (2n)(i) = 


t n ~ 1 • e"*/ 2 


2" • (n- 1)!’ 

for all t > 0 , and the corresponding cdf F x 2^n) admits the form 

n—1 


( 10 ) 


F. 


X 2 (2' 


,)(*) = 1 -e X/2 'Y,W. 


fe =0 


x 


( 11 ) 


for all x > 0 . 

The real-valued Gaussian random variable has a straightforward generaliza¬ 
tion to the complex plane. Suppose that X, Y ~ N(0,1) are i.i.d. standard 
normal variables. We say that the random variable Z defined via the formula 

Z = X + i ■ Y (12) 

is a standard normal complex variable, and we denote its distribution by iVC(0, 1)0 
Similarly, if d > 0 is a positive integer, we say that the complex d —dimensional 
random vector Zd has complex standard distribution NClfidj Id) if all of its co¬ 
ordinates are i.i.d. standard normal complex variables. Needless to say, in this 
case 


II^H 1 2 ~X 2 (2-d). 


(13) 


The following well known theorem states the fact that Gaussian distribution 
is invariant under orthogonal transformations. 


1 Chi-square with d degrees of freedom. 

- in some sources, the same distribution might be denoted ;YC(0. 2) or ;YC(0, 2; 0). 
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Theorem 1. Suppose that d > 0 is a positive integer, that U is the d by d 
unitary matrix, and that the complex d— dimensional random vector Z d has dis¬ 
tribution NC(0d, Id)- Then, 

U-Z d ~ NC(O d ,I d ). (14) 


The following theorem describes some additional properties of complex nor¬ 
mal random variables. 

Theorem 2. Suppose that n > 0 is a positive integer, and that ai,...,a n 
are complex numbers. Suppose also that z\,...,z n are i.i.d. standard normal 
complex random variables. Then, 

a\- z\-\ -b a n ■ z n ~ NC(0, 1) • i/|ai| 2 -b |a n | 2 . (15) 


2.2 Linear Algebra 

The following widely known theorem can be found, in a more general form, in 
most standard books on linear algebra (see, for example, [9], [IS]). 

Theorem 3. Suppose that n > 0 is a positive integer, and that H is an n by n 

complex matrix. Then, there exist non-negative real numbers ay > • • • > a n > 0 

and n by n unitary matrices U and V such that 

H = U - Yi-V*, (16) 

where V* denotes the conjugate transpose of V, and £ is the diagonal n by n 
matrix whose diagonal entries are defined via the formula 

T^i,i — &i, ( 17 ) 

for every i = l,...,n. The columns ui,...,u„ of U are referred to as left- 
singular vectors, the columns ty,..., v n of V are referred to as right-singular 
vectors, and ay are called the singular values of H. The factorization 

(fTH is typically referred to as the singular value decomposition (SVD) of H. 

3 Analytical Apparatus 

The purpose of this section is to provide the analytical apparatus to be used in 
the rest of the paper. 

Theorem 4. Suppose that n > 0 is a positive integer, that H is an n by n 
complex regular matrix, and that the matrices U , £, V constitute the SVD of H, 
as in (1161) of Theorem 0 Suppose, in addition, that w ~ NC(O d , I d ) is the 
standard normal complex random vector in C d . Then, 

H~ 1 ■ w ~ — • Vi H-b — ■ v n , (18) 

where the real numbers ay,..., a n are the singular values of H, the n— dimensional 
complex vectors v\,... ,v n are the columns of V, and z\, ..., z n are i.i.d. stan¬ 
dard complex normal random variables, as in m- 
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Proof. Due to (fTBl) in Theorem [3] 

H~ l = V- E " 1 • U*, (19) 

where E” 1 is the diagonal matrix whose diagonal entries are erf 1 ,..., erf 1 . We 
combine (fill) in Theorem [T] with (TTHl) to obtain (1181) . ■ 

Corollary 1. Suppose, in addition to the hypothesis of Theorem [^J that 1 < 
i < n is an integer. Then, 


{H 1 ■ w) (i) 


z i 


N 


n— 1 

E 

k =1 


M*)l s 


+ Z n 


M»)l 


( 20 ) 


where, for any complex vector v in C n , we denote by v(i) its ith coordinate. 
Proof. We combine Theorem [2] with (1191) to obtain (121)1) . ■ 


4 Numerical Algorithms 

In this section, we describe several numerical algorithms for the decoding of 
a digital signal sent over a noisy MIMO channel, in the sense described in 
Section P below. 

4.1 Principal Decoding Task 

In this section, we provide a formal description of the principal decoding task 
used in the rest of this paper (see e.g. [2], [IT] . [12] . ill]. [II]). 

Suppose that n > 0 and m > 0 are integers, that 

C = {Cl, , C m } (21) 

is a collection of to points in the complex plane (often referred to as ’’constella¬ 
tion” in the literature), that H is an n by n complex matrix, and that a > 0 is a 
real number. Suppose also that X is the collection of to" vectors in C" defined 
via the formula 

X = {(x(l),...,x{n)) T : x(l),... ,x{n) € c| , (22) 

and that x t rue is a vector in X (we think of Zti-ue as of the unknown trans¬ 

mitted message). Suppose also that, for every x in X, the random complex 
n— dimensional vector y(x) is defined via the formula 

y{x) = H ■ x + cr ■ Z n , (23) 

where Z n ~ N<C(0d, Id) is the standard normal d —dimensional complex random 
vector (see Section [2d] above). 

Task. Suppose that itrue in X is the transmitted message, and that y 0 b s in 
C" is the observed value of y{x t rU e)- Given X, H, a and y 0 bs) one needs to find 
Xtrue- More specifically, one needs to find Xbest in X such that 

• £best ^obs| \\H * "Z ^/obsll) (24) 

for any x in X. In other words, Xbest is the maximum likelihood estimate of 

*£true* 
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Remark 1. Obviously, for any fixed H and any Xt.me in X, the probability 
that Xbest defined via (HD is equal to Xt.rue depends on cr (more specifically, this 
probability decreases as a increases). In particular, for noise of sufficiently large 
coordinate-wise standard deviation a, the best likelihood estimate is unlikely to 
coincide with the transmitted message. 

In the view of Remark [TJ we make the following observations. 

1. If Xbest defined via (I2H) coincides with xtruej a numerical scheme for 
decoding y D b s should be able to recover Xtrue (the task is well defined). 

2. If Xbest defined via (1240 is different from Xtrue, then there is not enough 
information for recovering Xtrue (without additional assumptions). In that case, 
it might still be desirable for a decoding numerical scheme to recover Xbest- 

4.2 Running Example 

For the sake of concreteness, we introduce the following example (by specifying 
typical values of parameters of the model from Section ITT1) . to be used in the 
rest of this paper. 

1. The dimensionality n of the channel matrix (the number of transmitters 
and receivers): 


n = 6,7, 8 . (25) 

While the value n = 8 seems to be typical in many application, the schemes 
should handle values up to n = 16 or even n = 32. 

2. The number m of constellation points: 

m = 8 . (26) 

While constellations of size m = 8 are frequently used in applications, many 
popular constellations contain m = 16 or m = 32 points, and these values 
should be kept in mind. 

3. The constellation C: 

C = je*' 7r ' fe / 4 : 0 < k < 8} (27) 

(the so-called 8 -PSK constellation). In other words, C consists of m = 8 points 
equally distributed on the unit circle. 

4. The channel matrix H: in agreement with a common practice, we assume 
that the n 2 entries of H have been independently drawn from the standard 
normal complex distribution iVC(0,1). In other words, 

~ NC(0, 1), (28) 


for every i, j = 1 ,..., n. 

In the rest of this section, we describe several decoding schemes. Some of 
them are well known; others are apparently novel. All of these schemes receive 
the observed value y 0 b s of y(xt rue ) as an input and look for Xbest defined via 

EH . 
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4.3 Brute force algorithm 

The brute force algorithm locates Xbest by searching through all of X. In other 
words, it consists of the following steps: 

Step 1. For every x in X, evaluate the real number d(x) via the formula 

d(x) = \\H ■ x — j/ 0 bs|| ■ (29) 

Step 2. Define Xbest via finding the minimum among all d{x ), i.e. 

d(x hest ) < d(x), (30) 

for all x in X. Obviously, (l3U|) is equivalent to (1241) . 

Memory requirements. While the collection X is typically quite large (it 
contains to" vectors - see urn all the vectors in X can be iteratively computed 
one by one in a straightforward way, thus obviating the need to pre-compute 
X and store it in memory (as far as the brute force algorithm is concerned). 
Hence, the memory requirements of the brute force algorithm are minimal in 
the sense that it requires only 

Mbrute = O (n 2 + m ■ n) (31) 

memory words. In other words, practically speaking, the brute force algorithm 
does not require any memory to speak of (as to, n are typically very small: see 
Section 14.21) . 

Cost. The cost of the brute force algorithm is 

Cbrute = o (|X| • n 2 ) = O (to" • n 2 ) (32) 


operations. 

Success rate. The brute force algorithm always locates .Tb es t and, in this 
sense, is optimal. 

Remark 2. Even for as small values ofm and n asm = n = 8 (see Section \4Jfy , 
the brute force algorithm requires about 8 10 ss 10 9 operations. Since in a typical 
application one needs to decode thousands or even millions messages per second, 
its high cost deems the brute force decoding algorithm impractical. 

Conclusion. While the brute force algorithm is useless in practical appli¬ 
cations, it is a reasonable (albeit slow) testing tool, due to its 100% success 
rate. 

We conclude by summarizing the principal input and output parameters of 
the brute force algorithm described above. 

Calling sequence: 


brute(y 0 b s ; *best)- 


(33) 


Input parameters: 

- y ohs in C": 

the received message (a noisy observation of y(xtrue) defined via (I251l l. 

Output parameters: 

— *£best In X. 

the maximal likelihood estimate of Xtrue (see (1^41) 1. 


4.4 Nearest Neighbors in H ■ X 

Obviously, the vector H ■ :Ebest (see E41) ) is simply the nearest neighbor of y 0 b s 
within the collection HX of vectors in C” defined via the formula 


HX = {H x : xeX}. (34) 

This observation suggests using a fast nearest neighbors algorithm for comput¬ 
ing irbestj such as, for example, the randomized approximate nearest neighbors 
algorithm (RANN) described in [15], [T6] (needless to say, the straightforward 
nearest neighbors search is simply the algorithm from Section POD . Obviously, 
a nearest neighbor search in C” is equivalent to that in R 2 ". 

RANN consists of two steps briefly described below (the reader is referred 
to m for a more detailed description, and to [3] for another nearest neighbors 
algorithm): 

Preprocessing (depends on H but not on y 0 hs)- One constructs a 
tree-like structure on HX, recursively subdividing the points in HX along each 
of 2 n randomly chosen real axes. The resulting ’’tree of boxes” consists of 2 2 ” 
boxes, each containing (to/ 4)” points on average. Typically, several random 
subdivisions are constructed (one can also further refine each tree by subdividing 
boxes that contain too many points). 

Query for nearest neighbors of y 0 bs- Once the auxiliary tree of boxes 
on HX has been constructed, the search for nearest neighbors of any given y a b s 
is done as follows. First, one locates the box that y Q b s belongs to; then, the 
points in this box (and, possibly, in several boxes nearby) are inspected one by 
one. 

Success rate. The success rate depends on such parameters as the number 
of trees used, their internal structure (e.g. the number of subdivisions in each 
tree, average number of points in a box), etc. In particular, there is a trade-off 
between probability of locating the nearest neighbor of y 0 b s and required CPU 
time. However, in this environment one can typically achieve success rate of 90% 
or higher without increasing CPU time too much; to that end, the parameters 
of the nearest neighbor search are best tuned empirically. 

Memory requirements. Storing the data structures constructed in the 
pre-processing step requires 


M preprocess = O (to”) (35) 

memory words per a single tree. In particular, even for such small values of 
to and n as to = n = 8 (see Section 14.211 , the memory requirements can easily 
exceed several Gigabytes. 

Cost. The cost of a single query for the nearest neighbor of y 0 b s is propor¬ 
tional to the depth of the tree and the number of points in a box; in particular, 
it will be of the order 


Cquery = O [u 2 • log 2 (m)) (36) 

(see e.g. HU). In other words, running time of a single query is likely to be 
reasonable (compare to (1331) . and see also Section fOl) . However, the cost of the 
pre-processing step will be at least 

Cp r e P roces = O (to”) (37) 
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operations. While this step does not need to be redone for each new y 0 bs, one 
still need to pre-process the data from scratch every time the channel matrix H 
changes. In particular, in any application where H is frequently re-evaluated, 
this creates an additional major obstacle to using such algorithms. 

Conclusion. We strongly suspect that any generic nearest neighbors algo¬ 
rithm (used as a black box) will be impractical for this decoding problem, in 
terms of either memory requirements or CPU time (or both), simply due to the 
sheer amount of points in HX. If, in addition, the channel matrix H changes 
frequently, such approaches are likely to be completely unaffordable. 

4.5 Nearest Neighbors in X 

In Section l4~4l above, we discussed nearest neighbors searches within the collec¬ 
tion HX defined via ([34]) . In comparison, for any vector y G C n and integer 
1 < k < n, to find k nearest neighbors of y within X is a much easier task, for 
the following reasons: 

1. Due to (1^1) . for any x in X, 


II V - *H 2 = 1 2 /( 1 ) - *(1)| 2 + •'' + \y{n) x(n )| 2 , 


(38) 


where :r(l),..., x{n) are in C. Thus, the search for nearest neighbors is reduced 
to n one-dimensional searches. For example, to find the first nearest neighbor 
of y in X one simply needs to minimize \y(j) — x(j) | separately for every j = 
1, • • • ,n. 

2. The collection X enjoys various symmetries that can be employed in 
nearest neighbor searches. For example, if C is defined via m, and one has 
found the first k nearest neighbors of 


x 0 = (1,...,1) T 


(39) 


within X , then, for any x in X and every 1 < j < k, the jth nearest neighbor 
of x within X is computed from that of xq by coordinate-wise rotation (in n 
operations). In other words, if Xj is the j th nearest neighbor of xq, then the jth 
nearest neighbor Xj of x is defined via the formula 


Xj = (xj( 1) • x(l),.. . ,Xj(n ) • x{n)). 


(40) 


3. As opposed to HX, the collection X obviously does not depend on H, 
and thus any pre-computation on X needs to be done only once and for all. 
These observations suggest the following algorithm for decoding y 0 b s : 

Step 1 . Select a positive integer 1 < k < m n . 

Step 2. Compute the vector i 0 b s in C n via the formula 



(41) 


Step 3. Find the k nearest neighbors X\, ..., Xk of s 0 b s within X. 
Step 4. For every j = 1,..., k, evaluate dj via the formula 


dj — ||H • Xj 2/obs|| ) 


( 42 ) 


and find the index jo that corresponds to the minimal dj. 
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Step 5. Return x nn defined via the formula 

•^nn(fc) "Do (43) 

(obviously, the subscript k refers to the number of nearest neighbors used to 
evaluate x nn ^))- 

The vector x nn computed by this algorithm is an approximation to Zbest 
defined via (l24l) . 

We summarize the principal input and output parameters of the algorithm 
described in this section as follows. 

Calling sequence: 

nnx(y obs ;fc;Xobs,Znn(fc))- (44) 

Input parameters: 

- y 0 bs in c n : 

the received message (a noisy observation of y(x trU e) dehned via E51) l. 

- k (a positive integer): 

the number of nearest neighbors to be used. 

Output parameters: 

- fobs in X: 

the "naive” candidate for cc tr ue (see (ED)- 

- *^nn(fc) ^ X. 

the candidate for Xtrue found by nearest neighbors search in X (see (BSD). 

Memory requirements. The memory requirements for this algorithm are 
minimal; one might only need to store H and the list of k nearest neighbors of 
a; 0 bs in A', which requires 

M nn = 0(k ■ n + n 2 ) (45) 

memory words. Compared, for example, to (BSD, this is essentially negligible. 

Cost. It costs 0(n 3 ) operations to invert H\ however, if, for example, 
we have evaluated the SVD of H beforehand, the evaluation of Jobs via (ED 
costs only 0(n 2 ) operations. The evaluation of x\, . .. ,Xk will typically require 
0(n-k) operations, and, for every j, the evaluation of dj via (1421) requires 0(n 2 ) 
operations; thus, the total cost of the algorithm is 

C nn = 0(k■ n 2 ) (46) 

operations. 

Success rate. Unfortunately, unless H is close to an orthogonal matrix or a 
in K51) is small compared to the smallest singular value of H (see Theorem 0) , 
the success rate of this scheme will typically be poor (loosely speaking, due 
to Theorem 0] - in other words, H~ l magnifies the noise by different factors 
in different directions). In the case of the model described in Section l4~2l the 
success rate of this scheme is investigated empirically in Section [S] below. 

4.6 Randomized Decoder 1 (RaDel) 

In this section, we described a randomized decoding scheme based on Theorem[4] 
and Corollary Q] This scheme will be referred to as "Randomized Decoder 1”, 
or "RaDel”. 
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4.6.1 Preliminary discussion 


Suppose that I/, H,V constitute the SVD of H (see Theorem [3]). Suppose also 
that the random vector y(x t rue ) is defined via (pi) . and that the random vector 
x in C n is defined via the formula 

X = H~ 1 -y(x true)- (47) 

Due to the combination of (lT7l) with Theorems [3j U 

n 

x ~ Xtrue = y ~]z k - V k , (48) 

,, Vk 
k =1 

where vi ,..., v n are the columns of V, o\,... ,a n are the singular values of H , 
and Z\,.. . , z n are i.i.d. standard complex normal variables. 

Observation 1. The identity C51) admits the following interpretation: 
while the distribution of y(x true) is radially symmetric about H ■ Strue, the 
distribution of x is not. More specifically, the ’’noise” in x has the largest vari¬ 
ance in the direction of v n and the smallest variance in the direction of v±. In 
other words, z n has larger effect on how far x is from Xtrue that z i does. 

Suppose now that, for every j = 1 ,,n and every k = 1,... ,n, we define 
the real numbers s k (j) and S k (j) via the formulae 

S,W = hiffli, (49) 

Vk 


SkU) 


N 




1=1 


(50) 


respectively. It follows from the combination of (H^l) . (15(71) and Corollary [I] that 

x{j) ~ X t rue{j) + z- a - S n -i(j) + Z n ■ <7 ■ S n (j), (51) 

for every j = 1,..., n, where z, z n are i.i.d. iVC(0,1) (obviously, x(l),..., x(n) 
are not independent of each other). 

The coefficients S n -i(j) and s n (j) in (15H are determined by H. In the 
case when the entries of H have been drawn independently from iVC(0,1) (see 
Section 14.21) , we can make the following observation. 

Observation 2 . Suppose that 6 < n < 32 is an integer, that the entries of 
H are i.i.d. NC(0, 1), and that the real random variable r = r(H ) is defined 
via the formula 

r(H) = (52) 

Sn- lU) 

where, for any H, 1 < j < n is the integer such that S n -i(j) < S n -i(i) for 
every i = 1,..., n. Then, 


0.25 <E[5„_r(j)] <0.3, (53) 

and also 


4.5 < E [r(H )] < 6. (54) 

In other words, the coefficient in front of z n in m is about 5 times larger (on 
average) than that in front of z. 
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4.6.2 RaDel: informal description 


The discussion in Section 14.6.11 above leads to the following decoding scheme 
(based primarily on (l5ll) and Observation 2 above). 

We select 1 < j < n such that S n -i(j) in (l5ll) is minimal. Then, for every 
* = 1 ,..., m, we assume that 


Ztrue(j) = Ci, (55) 

and proceed as follows. Under the assumption (l55l) . the difference x(j)—Xt Iue (j) 
is a sum of two independent complex normal variables with different variances 
(see HID). We sample one of them; more specifically, we draw the complex 
number 5 from the distribution iVC(0,1). The assumptions (l55l) and 2 = 5 
determine the value of z n in (Hull . However, we observe that z n is ” responsible” 
for the largest component of the noise in x, due to (l48l) . Thus, we can reduce 
the noise by subtracting z n - a ■ a~ x ■ v n from the observed value of x. Next, we 
look for the nearest neighbor Xi of the ’’improved” x in X (a simple task; see 
Section BHD • We observe that, provided that the assumption (l55l) is correct and 
the noise in x has been reduced, Xi is more likely to be equal to x t rue than if we 
were to use the nearest neighbor search in X for the original x. 

To validate the assumption (15511 . we make several observations. 

1. For each i = 1 the assumption (1551 yields Xi . We can simply 

select the best x * out of Xi,... ,x m (the one for which \\H ■ Xi — y 0 bs|| is the 
smallest). 

2. If Xi = ®true 5 then we can determine the noise from (1231 : in particular, 
we can compute the square of the Euclidean norm of the noise. However, this 
quantity must be an observed value of a % 2 ( 2 n) random variable (multiplied by 
cr 2 ), whose cdf is defined via (ED; we use it to estimate our ’’confidence” in the 
statement that Xi = Zti-ue- 

3. The value of z n must be an observed value of a standard normal complex 
variable. In particular, \z n \ 2 ~ y 2 (2), and we can reject the assumption (l55l) if 
| 2„| 2 is too large. 

Finally, we observe that the scheme described above is randomized. The ap¬ 
parent downside is that, even when the assumption (1551 is correct for a particular 
value of i, there is a finite probability that Xi ^ ccbest (see (ED)- However, the 
obvious advantage of randomization is that the scheme can be iterated (and the 
probability of failure will decrease with the number of iterations; see, however, 
Remark B1 in Section l5~H) . 

4.6.3 Basic RaDel: detailed description 

This section contains a detailed description of the decoding scheme described 
in Section 11. 6 . 21 above. 

Precomputation. Suppose that H is the n by n channel matrix. 

Step 1 . Evaluate the SVD (e.g. the matrices U, E, V") of H (see TheoremBD- 

Step 2. For every k = 1,... ,n, evaluate s n (k) and S n -i{k) via (l49l) . (l50l) . 
respectively. 

Step 3. Find 1 < j < n such that S n -i(j ) < S n -i(k) for every k = l,...,n. 
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Decoding. Suppose that x t rue in X is the (unknown) transmitted message, 
that a > 0 is the component-wise variation of the noise in (1231) . and that y Q b s 
is the observed value of y{x true ) (see (1251) 1. 


For every i = 1,..., m proceed as follows: 

Step 1 . Sample the complex number z ~ NC( 0,1). 
Step 2. Evaluate i 0 bs in C" via m- 
Step 3. Evaluate z n via the formula 

x 0 bs(j) - - z ■ a ■ S n -i(j) 

Zn — ( 

Vn(j) 

where v n in C n is the nth column of V. 

Step 4. Evaluate x, in C n via the formula 


Xi — .X'ob.s Zn • V n . 

Step 5. Find the nearest neighbor x,; of X{ in X. 

Step 6. Evaluate Wi in C n via the formula 

Wi = y 0 b s - H ■ x^ 

Step 7. Evaluate the real numbers r,; and \i y i a fl ie formulae 

|wi(l )| 2 H-f \wi(n )\ 2 

r i = -o- 


and 


(56) 


(57) 


(58) 


(59) 


Xi = 1 ~ F x 2 (2n)(ri), (60) 

respectively, where F x 2 ( 2n ) is the cdf of the yin-distribution (see (fTTl) l. 

Thus, for each i = 1 the scheme produces x, £ X and the real 

numbers rj and \i- Suppose that io is the index of the smallest Xj (and thus 
largest \i) among ri,...,r„. The algorithm returns the vector XR a Dei in X 
defined via the formula 


XRaDei — Xi 0 . (^^) 

Observation. Under the assumption that Xt rU e = Xj, the vector iv t de¬ 
fined via (1581) is an observed value of the random vector having the distribution 
a ■ NC(0d,Id) (see (12511 1. Consequently, n is an observed value from the y | n 
distribution, and \i is the probability that a y 2 n random variable attains values 
larger than r t (we refer to as the ’’confidence”). 

As a conclusion, we summarize the principal input and output parameters 
of the decoding scheme described in this section. 

Calling sequence: 

RaDel_search (y ohs - x RaD ei, r, y). (62) 

Input parameters: 
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- y ohs in C n : 

the received message (a noisy observation of y(a;true) defined via (l23l) h 

Output parameters: 

- ajRaDei in X: 

the candidate for the transmitted message (see (ED)- 

- r (a positive real number): 

the squared norm of the noise (provided that Ztrue = ^RaDei), see (15U1) . 

- x (a real number between 0 and 1): 
the confidence in XR, a Dei (see (l60l) 'l. 

4.6.4 RaDel: full algorithm 

In this section, we describe a decoding algorithm whose basic step is the scheme 
described above (see Sections 14.6.21 14.6.41) . 

Additional parameters. 

- min_RaDel: a positive integer (minimal number of iterations) 

- max_RaDel > min_RaDel: a positive integer (maximal number of itera¬ 
tions) 

- Xthresh- a real number between 0 and 1 (a confidence threshold) 

Description. 

1. For each 1 < j < min_RaDel, call RaDel .search (y 0 bs', Xj, rj, Xj)- 

2. Among all (xj,rj,Xj), select the triplet 


(•TRaDel i ^UaDel ? XRaDel ) — i i‘t'i •> Xi) (63) 

that corresponds to the smallest rj. 

3. For each min_RaDel < j < max_RaDel: 

3a. call RaDel_search (y Q b s ; , rj, Xj). 

3b. if rj < r RaDe i, set 

(SR aD el,rRaDel,XRaDel) = (Xj,Tj,Xj)- (64) 


if XRaDel ^ Xthreshj Stop. 


Comment. The algorithm conducts at least min_RaDel iterations of the 
basic scheme. Then, if the confidence in the best guess is high enough (compared 
to Xthresh), the algorithm stops. Otherwise, the basic scheme is called iteratively 
until the confidence is high enough or the total number of iterations reaches 
max_RaDel. 

To conclude, we summarize the principal input and output parameters of 
the algorithm described in this section. 

Calling sequence: 


RaDel_all (y obs ; min_RaDel, maxJtaDel, Xthresh! ZRaDei , r Ra Dei, XRaDel) ■ 

(65) 


Input parameters: 

- Vohs in C ra (see Section 14.6.31) . 

- min_RaDel: see above. 

- max_RaDel: see above. 
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o < Xthresh < 1: see above. 


Output parameters: 

- ZRaDei in X (see ©). 

- rR a Dei (a positive real number): see (163 D . 

- XRaDei (a real number between 0 and 1): see (l63ll . 


4.6.5 RaDel: cost and memory requirements 

In this section, we describe the cost and memory requirements of the basic 
decoding scheme described in Sections 14.6.2114.6.31 above. 

Memory requirements. The memory requirements of the scheme are or 
the order 


M RaDe i = 0 (m-n + ri 2 ) ( 66 ) 

memory words - in other words, absolutely minimal (compare for example to 
(1551) : see also Section IT2l for typical values of m and n). 

Cost. The precomputation step (see Section 14. 6 . 3D requires 0(n 3 ) opera¬ 
tions, and should not be re-done until H changes. 

On the other hand, each decoding step requires 

CRaDei = 0(m- n 2 ) (67) 

operations. Therefore, the full algorithm described in Section 14.6.41 requires 
between 


CRaDei,best = O (min_RaDel ■ m ■ n 2 ) 


and 


^RaDel,worst — ^ ^maX_RaDGl * 7TI • Tl ^ 


( 68 ) 


(69) 


operations. 

Success rate. Obviously, the success rate of RaDel depends on various 
parameters of the problems (e.g. H,cr,X) as well as on the parameters of the 
algorithm (see Section I4.6.4D . In addition, since the scheme is randomized, for 
each set of conditions, there is a certain probability of failure (that decreases 
with the number of iterations of the basic step). Obviously, there is a trade-off 
between the probability of success and number of iterations. 

The performance of the scheme is demonstrated via several experiments in 
Section [5] 

4.7 Randomized Decoder 2 (RaDe2) 

In this section, we describe yet another decoding scheme. This scheme is closely 
related to RaDel from Section ITTH and can be viewed as a generalization of the 
latter. This scheme will be referred to as ’’Randomized Decoder 2”, or ”RaDe2”. 
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4.7.1 Preliminary discussion 

The decoding scheme RaDel from Section 14.61 is based on the equation (15X1) . 
Loosely speaking, one fixes the coordinate j, ’’guesses” the value of x true (j). 
samples z from iVC(0,1), and determines the value of z n from (15X1) . Then, the 
noise in x is reduced by subtracting its (estimated) largest component in the 
direction of v n (see (1X51) 1. 

In this section, we describe an algorithm that uses two (rather than one) co¬ 
ordinates. The resulting scheme is more accurate than RaDel (see Section HTH) : 
however, it is also more computationally expensive. 

Along the lines of Section l4.fi. 11 we make the following observation. Suppose 
that 1 < j-\. j 2 < n correspond to the two smallest values of S' rl _ 2 (l),..., S n - 2 {n) 
defined via (loUl) . Due to the combination of (1X51) . (1XX)1) . (15U1) . 

x(jl) ~ ^true(il) + U-CT ■ S n - 2 (j l) + Z n - 1 • (7 • S„_i(ji) + z n ■ a ■ S n (j i), (70) 
x{j 2 ) ~ .Ttrue(j2) + W-CT ■ S n - 2 {j 2 ) + Z n -i • (7 • S n -i(j 2 ) + Z n ■ <T ■ S n (j 2 ), (71) 

where u,w ~ iVC(0,1) (however, u and w are not independent of each other), 
while z n -i,z n ~ ./VC(0,1) are independent of each other and of u,w. 

Obviously, due to (l50l) . the coefficients of u,w in CD are even smaller than 
the coefficient of z in m (see Section l4~6l in particular Observation 2). 

4.7.2 RaDe2: informal description 

The discussion in Section 14.7.11 above leads to the following decoding scheme 
(somewhat similar to that from Section 14.61) . 

We select 1 < ji,j 2 < n such that S n - 2 (ji), S n - 2 (j 2 ), respectively, are 
the smallest among S n - 2 (1 ),..., S n - 2 (n). Then, for every pair of indices 1 < 
ii,i 2 < m, we assume that 


•Etrue(jl) — Cii , (X2) 

^true(j2) ^ Ci 2 , (73) 

and proceed as follows. Under the assumption 113, the difference x(j) — xt IU e(j) 
for each j = ji,j 2 is a sum of three independent complex normal variables 
with different variances (see ([7X1) 1. We sample u,w in (17X1) by drawing i.i.d. 
z\,... , z n - 2 ~ 1VC(0,1) and using (PXH1) . Then, we determine the values of 
z n - 1 , z n in ED under the assumption ED and u = u, w = w by solving the 
corresponding two by two linear system. We reduce the noise in x by subtracting 
z n -1 • a ■ a~_ i • v n -\ and z n ■ a ■ ■ v n from the observed value of x (see (PX51) b 

Next, we look for the nearest neighbor x^^ 2 of the ’’improved” x in A' (see 
Section 1X31) . We observe that, provided that the assumption ED is correct and 
also the noise in x has indeed been reduced, x n >7 ; 2 is more likely to be equal to 
cctrue than if we were to use the nearest neighbor search in X for the original x. 

To validate the assumption (l55l) . we make several observations. 

1. For each 1 < ii,i 2 < m, the assumption ED yields ,,; 2 . We can simply 
select the best one out of the m 2 possible Xi lt i 2 (actually, we do not even need 
to examine all m 2 possibilities; see Section Pi. 7.61 below!. 

2. If Xi lt i 2 = Xtruej then we can determine the noise from (l24l) : in particular, 
we can compute the square of the Euclidean norm of the noise. However, this 
quantity must be an observed value of a y 2 (2n) random variable (multiplied by 
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er 2 ), whose cdf is defined via CD; we use it to estimate our ’’confidence” in the 
statement that Xi lt i 2 = Xtrue- 

3. The value of |z n _i | 2 + \z n \ 2 must be an observed value of a y 2 (4) random 
variable, and thus we can reject the assumption (ESI before evaluating Xi lt i a if 
\z n \ 2 is too large. 


4.7.3 Basic RaDe2: detailed description 

This section contains a detailed description of the decoding scheme described 
in Section 14.7.21 above. 

Compared to the scheme described in Section 14.6.31 the procedure below ac¬ 
cepts an additional input parameter: namely, the ’’early exit confidence” % s top 
(a real number between 0 and 1 : see (1501) below). 

Precomputation. Suppose that H is the n by n channel matrix. 

Step 1 . Evaluate the SVD (e.g. the matrices U, E, V ) of H (see Theorem[SJ. 

Step 2. For every k = l,...,n, evaluate s n (k),s n -i(k) and S n - 2 (k) via 
(Ml, AMD, respectively. 

Step 3. Find 1 < j \, j 2 < n such that S n - 2 (ji ), S n - 2 (ji) are the two small¬ 
est values among S'„_ 2 (l), ..., S n - 2 {ri). 

Decoding. Suppose that Xt Iue in X is the (unknown) transmitted message, 
that cr > 0 is the component-wise variation of the noise in (1231) . and that y 0 b s 
is the observed value of y(x true ) (see A2S|). 

For every i\ = 1 ,,m and *2 = 1,..., m, proceed as follows: 


Step 1 . Sample the n— 2 complex numbers if,..., Zn —2 independently from 
7VC(0,1). 

Step 2. Evaluate u, w via the formulae 

n —2 

u = 'S^'z k - - V k {ji), ( 74 ) 

£i ak 

n—2 

w = y^z k - v k (j 2 ), (75) 

ti ak 

where v k in C n is the fcth column of V for every k = 1 ,..., n. 

Step 3. Evaluate x 0 b s in C" via (ED- 

Step 4. Evaluate z n _i, z n via solving the two by two linear system 


a-v n -i{ji) 

■ + z n ■ 

■ v n (j 1 ) 

Zn— 1 

®n— 1 

& n 

CT-Vn-! (j 2 ) 

■ + z n ■ 

a ■ Vnfa) 

Zn— 1 

®n— 1 

® n 


.^obs(jl) (-ii &■> 

^obsO'2) - Cj 2 - W, 


in the unknowns z n -i,z n . 

Step 5. Evaluate the real numbers r and \ via the formulae 

r = \z n -i\ 2 + \z n \ 2 , 

X= l ~ F x i w( r )’ 


(76) 

(77) 
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respectively, where F x 2 ( 4 ) is the cdf of the distribution y 2 (4) defined via the 
formula ED- 

Step 6 . If 


X < Xstop? 


(80) 


skip the rest of the steps for these values of i\, 22 . 

Comment. Roughly speaking, if \ is too small, it means that 1 and z n 
are unlikely to be observed values of two i.i.d. NC(0, 1) random variables; in 
other words, the assumption is likely to be wrong for these values of i±, * 2 - 
Step 7. Evaluate Xi lt i 2 in C” via the formula 

* a --v n (81) 


— *^obs Z n —1 


On— 1 


’ Vn— 1 Zn 


Step 8. Find the nearest neighbor xi lt i 2 of Xi lt i 2 in X. 

Step 9. Evaluate Wi lt i 2 in C n via the formula 

,?2 J/obs F[ • Xj^ _ ,; 2 . 

Step 10. Evaluate the real numbers r u j 2 and via the formulae 


fU’i 


! (l)| 2 + --- + |tcii, i2 (n)| 5 


(82) 


(83) 


and 


Xii,i 2 1 F x 2(2n) i r 11 , 12)1 (84) 

respectively, where F x 2 ( 2 „) is the cdf of the x| n -distribution (see (fill) ). 


Thus, for every pair ( 21 , 12 ) for which Steps 7-10 were performed, the scheme 
produces Xi lt i 2 £ X and the real numbers ri lt i 2 and x*i ,® 2 - Suppose that, among 
these, the triplet 

(3'RaDe2i 2'RaDe2? XRaDe2) “ (*^ii ,i 2 3 ,12 3 X*i ,i 2 ) (85) 

corresponds to the smallest r ,;, >7 ; 2 (equivalently, to the largest Xii,i 2 )- 

As a conclusion, we summarize the principal input and output parameters 
of the decoding scheme described in this section. 

Calling sequence: 

RaDe2jsearch (y obs ; Xstop; ^RaDe 2 , r, x) ■ (86) 

Input parameters: 

- y ohs in C”: 

the received message (a noisy observation of y(a;true) defined via (T^TJ|) 1. 

- Xstop (a real number between 0 and 1): 

the confidence threshold used in Step 6 (see (15U1) 1. 


Output parameters: 

“ £RaDe2 iu X: 

the candidate for the transmitted message (see (1531) 1. 

- r (a positive real number): 

the squared norm of the noise (provided that Xtrue = £RaDe 2 ), see (l83l) . 

- x (a real number between 0 and 1 ): 
the confidence in SR, a De 2 (see (1511) 1. 
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4.7.4 RaDe2: full algorithm 


In this section, we describe a decoding algorithm whose basic step is the scheme 
described above (see Sections 14.7.21 14.7.31) . This algorithm is closely related 
to RaDel (see Section 14.6.41) . except that the basic step of the latter (see Sec¬ 
tion I4.6.3D is replaced with the one from Section 14.7.31 

Additional parameters. 

min_RaDe2: a positive integer (minimal number of iterations) 

- max_RaDe2 > min_RaDe2: a positive integer (maximal number of itera¬ 
tion) 

- Xthresh ■ a real number between 0 and 1 (a confidence threshold) 

- Xsto P : a real number between 0 and 1 (the early exit confidence; see (1501) 1. 

Description. 

1. For each 1 < j < min_RaDe2, call RaDe2_search (y 0 bs', X'stop! Xj, Tj,Xj)- 

2. Among all (a 'j,rj,Xj), select the triplet 

(*TRaDe2, rR a De2, XRaDe2) = {x j, Tj, \i) (87) 


that corresponds to the smallest r;. 

3. For each min_RaDe2 < j < max_RaDe2: 
3a. call RaDe2_search (y a b s ; Xsto P ; 

3b. if r :j < r RaDe 2 , set 


(^RaDe2, ?’RaDe2, XRaDe2) — (Xj , Vj , Xj)- 


( 88 ) 


3 C. if X'RaDe2 > Xthresh, Stop. 

Comment. The algorithm conducts at least min_RaDe2 iterations of the 
basic scheme. Then, if the confidence in the best guess is high enough (compared 
to Xthresh) j the algorithm stops. Otherwise, the basic scheme is called iteratively 
until the confidence is high enough or the total number of iterations reaches 
max_RaDe2. 

To conclude, we summarize the principal input and output parameters of 
the algorithm described in this section. 

Calling sequence: 


RaDe2_all (y ohs ; min_RaDe2, max_RaDe2, Xthresh, Xsto P ; £RaDe2, m a De2, XRaDe2) • 

(89) 


Input parameters: 

- y 0 bs in C” (see Section 14.7.31) . 

- min_RaDe2: see above. 

- max_RaDe2: see above. 

o < Xthresh <1: see above. 

0 < Xsto P < 1: the confidence threshold used in (1501) . 

Output parameters: 

- ZRaDe2 in X (see dHZD)- 

- ^RaDe 2 (a positive real number): see (1571) . 

- XRaDe 2 (a real number between 0 and 1): see (1571) . 
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4.7.5 RaDe2: cost and memory requirements 

In this section, we describe the cost and memory requirements of the basic 
scheme described in Sections 14.7.21 l477~3l above. 

Memory requirements. The memory requirements of the scheme are or 
the order 


M naDe2 = O (m 2 ■ n + n 2 ) (90) 

memory words - in other words, absolutely minimal (compare for example to 
(1551) ; see also Section 14.21 for typical values of m and n ). 

Cost. The precomputation step (see Section 14. 6. 3D requires 0(n 3 ) opera¬ 
tions, and should not be re-done until H changes. 

On the other hand, each decoding step requires 

C Ra ,r>e 2 = O (a ■ m 2 ■ n 2 ) (91) 

operations, where 0 < a < 1 is the proportion of pairs (ii, * 2 ) for which Steps 
7-10 in Section 14.7.31 are performed (e.g. for which y defined via (179D is greater 
than Xstop)- Therefore, the full algorithm described in Section 14.7.41 requires 
between 


CnaDe 2 . best — O (min_RaDe2 * tyi • n ) 


and 


C*RaDe2,worst — O (nifl.X_HaDe2 • Ul * Tl j 


(92) 


(93) 


operations. 

Observation. By comparing (1511) to (167D in Section 14.6.51 we observe that 
the basic step of RaDe2 (see Section 14. 7. 3D is typically somewhat slower than 
the basic step of RaDel (see Section 14. 6. 3D roughly by a factor of a ■ m, where 
0 < a < 1 is a real number (see (l9lT) above). Obviously, a depends, among 
other things, on the parameter Xstop (see Section 14. 7. 3D . 

Success rate. Obviously, the success rate of RaDe2 depends on various 
parameters of the problems (e.g. H,cr,X) as well as on the parameters of the 
algorithm (see Section I4.7.4D . In addition, since the scheme is randomized, for 
each set of conditions, there is a certain probability of failure (that decreases 
with the number of iterations of the basic step; see also Remark[5]in Section l575ll . 
Obviously, there is a trade-off between the probability of success and number 
of iterations. In addition, compared to RaDel from Section f4. 6. 41 RaDe2 is, in 
general, slower, but has a higher success rate (see Section [5] below). 

Also, in Section[5]we demonstrate the performance of the scheme via several 
experiments. 

4.8 Supercharging 

In this section, we describe a procedure that should not be used on its own, but 
rather as an additional step after the algorithms from previous sections, with 
the goal to improve their output at a moderate computational cost. 
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4.8.1 Supercharging: informal description 

Either of the algorithms RaDel, RaDe2 (see Sections 14.6114.71) computes a ’’can¬ 
didate” x for Xbest defined via J2S; in addition, it evaluates the confidence \ i n 
this candidate (see (El), ®). This candidate can be improved (e.g. replaced 
by a better candidate) by a procedure that we call ” supercharging”. 

The idea behind supercharging is based on the observation that even if x is 
not equal to .Tb es t, the latter can still be among several nearest neighbors of x 
in A. In other words, we fix the integer k\ of nearest neighbors that we wish 
to inspect and find Aq nearest neighbors aq,..., x^ of x in X (see Section l4~5ll . 
Among these, we find the one that minimizes the distance between H ■ Xi and 

2/obs- 


4.8.2 Supercharging: detailed description 

Suppose that y Q b s in C n is the observed value of a; tr ue (see Section ITT1) . that x in 
A is a candidate for Sbest evaluated by either RaDel or RaDe2 (see Sections I2~51 
S3): that r is the (normalized) squared norm of the noise, and that \ is the 
confidence in x (see ©, (EzD). Suppose also that k\ > 0 is the number of 
nearest neighbors of x that we want to inspect. Supercharging consists of the 
following steps. 

Step 1. Evaluate the Aq nearest neighbors aq ,... ,x^ of a; in A (see Sec¬ 
tion 14.511 . 

Step 2. For every i = 1,..., ki, evaluate the real number r, via the formula 

n = (94) 

Step 3. Find the minimum r t among rq,..., r jq ■ 

Step 4. If fj < r, evaluate x sup er in A and the real numbers r supel , Xsuper 
via the formulae 


*Esuper — 

^super — 

Xsuper = 1 i ? x a (2n)(ri), 

where F x 2( 2n ) is the cdf of the x 2 (^ n ) distribution defined via (fill) . 

As a conclusion, we summarize the principal input and output parameters 
of supercharging. 

Calling sequence: 

Super (^/obs: k\, X , r, a: SU per: r'super: Xsuper) • (^^) 

Input parameters: 

- y 0 bs in C": 

the received message (a noisy observation of y(x true ) defined via (125)1 h 

- k\ (a positive integer): 

the number of nearest neighbors to use. 

- x in A: 

the candidate for Xbest (see (l62ll . (IMll b 

- r (a positive real number): 

the normalized squared norm of the noise (see (1521) . (1551) 1. 


(95) 

(96) 

(97) 
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- x ( a real number between 0 and 1): 
the confidence in x (see (l62l) . (IM1) '). 

Output parameters: 

- ^super in A : 

the improved candidate (see (I551P . 

- rsuper (a positive real number): 

the corresponding normalized squared norm of the noise (see (19611 1. 

- Xsuper (a real number between 0 and 1): 
the confidence in a; sup er (see (I^TTl) 1. 


4.8.3 Supercharging: cost and memory requirements 

The memory requirements and number of operations of supercharging are given, 
respectively, by the formulae (H5l) . (HU in Section IT5l ( obviously, with k being 
replaced by fci). 

However, often this cost can be reduced roughly by the factor of n as follows. 
Suppose, for the sake of concreteness, that the constellation C is given via (EH) 
in Section EU Then, every x in X has 2 • n nearest neighbors x±,..., X 2 n in X 
all of which are at exactly the same distance from x. Each of x \,..., X 2 n differs 
from x at precisely one coordinate; in other words, for every i = 1 ,..., 2n, the 
difference x — Xi has only one non-zero coordinate. In particular, each of r, 
defined via (1M1) can be evaluated in 0(n) rather than 0{n 2 ) operations. In 
other words, if, for example, 


hi = 2 ■ n, (99) 

then the total cost of supercharging is 

C su per(2 • n) = 0(n 2 ) (100) 

(rather than 0(n 3 )) operations. 


5 Numerical Results 

In this section, we illustrate the performance of the schemes from Section [4] 
via several numerical experiments. All the calculations were implemented in 
FORTRAN (the Lahey 95 LINUX version), and were carried out in double 
precision, on a standard laptop computer with DualCore CPU 2.53 GHz and 
2.9GB RAM. 

5.1 Experiments: basic structure 

All the experiments are build around the model described in Section 14.11 In 
addition, we choose our parameters based on the running example from Sec¬ 
tion 021 

Each experiment consists of five stages described below. 
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5.1.1 Stage 1: preparation 

On this stage, we perform operations that do not depend on the channel matrix 
H, let alone the transmitted (or received) message. 

We start by selecting the number of transmitters and receivers n (see 
the constellation size m (see (125)0 and the constellation C (see (12710 . In partic¬ 
ular, the constellation C is the same in all experiments, as is the collection X 
of all possible messages (see m)- 

Next, we select the integer k > 0 and find the list of k nearest neighbors of 
xq (see ®) in A' by brute force. For any x in X, this list allows us to compute 
its k nearest neighbors in O(k-n) operations (see Sect ion fOl in particular (TTO1P . 

5.1.2 Stage 2: H and related quantities 

On this stage, we form the n by n complex matrix H and carry out some 
computations that depend only on H. 

First, we generate H according to (1251) (i.e. by drawing n 2 independent 
samples from IVC(0,1) and using them as entries of H). 

Then, we compute the singular value decomposition of H (e.g. the ma¬ 
trices U,T,,V in the notation of Theorem a. We use S and V to evaluate 
the real numbers 5 rl _2(l), • • •, S n - 2 {n), S n -i(l ),..., S n -i(n) (see (1501) 1 and 
s„_i(l),..., s„_i(n), s„(l),..., s„(n) (see (l49lP . Also, we find the integer 
1 < j < n that corresponds to the minimal S n -i(j) among S' rl _i(l),..., S n -i(n) 
(see the precomputation step in Section l4.6.3l) . In addition, we find the integers 
1 < ji 5 J2 < n such that S n - 2 (ji ) and S n - 2 (j 2 ) are the two smallest values 
among £>,-2(1), ..., S n - 2 (n) (see the precomputation step in Section [~4.7.3ll . 

Finally, we select the real number a > 0 (roughly of the same order of mag¬ 
nitude that the smallest singular value a n of H : see Theorem a and Remark [T])- 
In Table |T] we summarize the input and output parameters of all computa¬ 
tions described in Sections 15.1.1115.1.21 

5.1.3 Stage 3: generation of messages 

During this stage, we choose, more or less arbitrarily, the integer L > 0, and 
generate L points Xtrue.i, • • ■, £true,L in X by taking L independent samples from 
the uniform distribution on X. (In the language of the model from Section [4. II 
each Xtrue,i represents a message to be transmitted). 

For each i = 1,..., L, we generate y 0 bs,i from itrue.i according to (G3D as 
follows: we sample a complex normal standard random vector Zi in C n (i.e. 
from the distribution NC(0 n , I n )) and evaluate y 0 bs,i via the formula 

2/obs,i = H • Xtrue,i T & ‘ (101) 

where a is that from Table [T) Needless to say, all Zi s are sampled indepen¬ 
dently of each other. In the language of the model from Section l4Tl each y D b s ,i 
represents a received message corrupted by Gaussian noise of coordinate-wise 
standard deviation a. 

Our ultimate goal is to decode each received message ytruey by computing 
the maximal likelihood estimate Xbesty to Xt rU e,i, hr the sense of (1241) . We do so 
by means of several numerical schemes from Section [|] Some of these schemes 
have a finite probability of failure (e.g. they compute a candidate for irbesty 
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Variable 

Details 

n 

dimensionality of received/transmitted message (1251) 

m 

size of constellation (EtH) 

c 

the complex constellation (1271) 

X 

the collection of all possible messages (l 22 l) 

k 

the number of nearest neighbors 

X \, . . . , X]$ 

the nearest neighbors of xq (1M1) in X 

H 

the n by n complex channel matrix (1281) 

U,v,v 

the SVD of H (see Theorem [3]) 

Vl, ...,v n 

the columns of V (vectors in C n ) 

(Jl, . . . , (J n 

the singular values of H (positive real numbers) 

S n _i(l), , S n —i(^Tlj 

noise deviations, see (1451) 

■ 5 ^n(^) 

noise deviations, see (1451) 

S' n _ 2 (l),...,S'„_ 2 (n) 

noise deviations, see (l5T)li 

S„_i(l),...,S„_i(n) 

noise deviations, see (H44) 

j 

the index of the minimal S n -i(j) 

31,32 

the indices of the two smallest S n - 2 {j i), *S'n- 2 (j 2 ) 

(J 

the deviation of noise in (l23l) 


Table 1: Basic experiment: list of input variables 


which is equal to the latter with probability less than one). Obviously, none of 
the schemes cannot use either x tn ie,i or Zi (e.g. both the original message and 
the noise are assumed to be unknown). In other words, each scheme is allowed 
to access only y 0 b s ,i as well as the variables from Tabic [T| 

5.1.4 Stage 4: decoding 

On this stage, we decode each of the messages y 0 bs.i, ■ • • ,Uobs,L by means of 
several schemes from Section [2 More specifically, we use the following schemes: 

1. Brute force (see Section fOl) . 

2. Nearest neighbors search in X (see Section l4~5l) . 

3. Randomized Decoder 1 (see Section I4~51) . 

4. Randomized Decoder 2 (see Section EH). 

Optionally, we improve the results RaDel (or RaDe2) by supercharging (see 
Section 14.81) . 

In addition to y 0 b Si and the variables from Table [lj RaDel and RaDe2 also 
receive several ’’tuning” parameters listed in Table [5] below (see Sections 14.6.41 
14.7.41 for details). 

For every i = 1,..., L, we proceed as follows: 

1. Evaluate Xbesty from y 0 b s ,i by brute force (see Section 14~51) . 

Comment. This is the ’’correct answer” (see, however, Remark |T| in Sec¬ 
tion 14.31) . 

2. Evaluate i 0 bs,i from y 0 bs,i via m- 

3. Evaluate x nn ^),i from y 0 bs,i via (1431) in Section I4~5l bv using k nearest 
neighbors. 

4. Evaluate x’R a Dei,i from y 0 b B ,i via RaDel search (see Section 14.6.41) . 
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Parameter 

Details 

Xthresh 

min_RaDel 
max RaDel 

the threshold confidence fsee Section 14. 6. 3114. 7. 3D 
minimal number of iterations of RaDel fsee Section 14. 6. 4D 
maximal number of iterations of RaDel fsee Section 14.6. 4D 

Xstop 

min_RaDe2 
max RaDe2 

the earlv exit confidence (see (1801) in Section 14. 7. 3D 
minimal number of iterations of RaDe2 fsee Section 14. 7. 4D 
maximal number of iterations of RaDe2 fsee Section 14.7. 4D 

isuper 

ki 

whether to perform supercharging (1) or not (0) 

the number of nearest neighbors in supercharging (see (199D). 


Table 2: Basic experiment: list of ’’tuning” parameters 


5. Evaluate XR a De 2 ,i from y 0 bs,i via RaDe2 search (see Section 14. 7. 411 . 

6. If isuper = 1, evaluate x supel ^ from the best of XRaDei.i,£RaDe 2 ,i by 
supercharging (see (11031) . (11041) below, and also Section I4~51) . 

In other words, for every i = 1,..., L, we execute 

brute(y 0 bs,i;^best,z) 
nnx(y 0 bs,i; k', x 0 bs,i, ^nn(fc),i) 

RclD6l_clll (^/obs,*? min_RaDel, maX_RaDGl, Xthresh? *£RaDel, 2 ? ^RaDel,i} XRaDel,i) 

RaDe2_all (2/obs,i5 min_Ra]I)G2, maX_RalI)G2, Xthresh? Xstop? •£RaDe2,i) ^RaDe 2,2 7 XRaDe2,i) 

( 102 ) 

(see (l33l) . (l44l) . (l65l) . (1891) '). In addition, if isuper = 1, we define Xi in X and the 
real numbers , Xi via the formula 

/ \ I (-TRaDel,* ? r'n.aDel,*; ,\RaDel,z) if rRaDelz Cf^ a D e 2 i, . no , 

{xi,n,Xi) = < , s ,, ( 103 ) 

( (ZRaDe2,i, rR aD e2,i, ,\RaDe2,i) Otherwise 

and execute 


super {Vobs^h ,Xi,n,x ii *Tsuper,0 isuper. / - Xsuper,i) (104) 

(see HMD)- 

5.1.5 Stage 5: evaluation of statistics 

In Section 15.1.41 above, we describe the decoding of a single received message 
by means of several schemes. In this section, we describe a way to compare the 
performance of these schemes via the evaluation of various statistics (essentially, 
via Monte Carlo simulations). Briefly speaking, we evaluate how much time, 
on average, it took for each algorithm to compute its output, and in what 
proportion of cases the output was correct. 

Each statistic is evaluated for every i = 1,..., L. In addition, we also average 
over several matrices H (corresponding to the same set of parameters). 

5.2 Experiment 1 

Description. In this experiment, we proceed as follows. For each n = 6,7, 8, 
we generate five n by n complex matrices H (see Section 15. 1.21 above!. For each 
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such matrix and each a = 0.25, 0.5, 0.75,1,1.25, we generate L = 1000 messages 
Xtrue in X, and for each such x t rue we generate y 0 b s as described in Section l5.1.3l 
For each such y 0 bs, we evaluate the maximal likelihood estimate Zbest of a^true 
via the brute force algorithm from Section 14.31 (see also Section 15.1.41) . 

Thus, for each n = 6, 7,8 and each a = 0.25, 0.5, 0.75,1,1.25, we obtain 5,000 
pairs (ztrue, I’best)- We evaluate the proportion of cases when ztrue is equal to 
a^best- The results of this experiment are displayed in Table [3] 



a 

n 

0.25 

0.5 

0.75 

1 

1.25 

6 

0.99840E+00 

0.83740E+00 

0.44200E+00 

0.17900E+00 

0.74800E-01 

7 

0.99940E+00 

0.87960E+00 

0.49780E+00 

0.20440E+00 

0.77200E-01 

8 

0.10000E+01 

0.90900E+00 

0.45600E+00 

0.15220E+00 

0.49000E-01 


Table 3: Average proportion of Xtme = %best, five channel matrices H per each 
n. The number of messages is L = 1,000 (per matrix). See Section HOI 

Observations. Several observations can be made from Tabled 

1. For each n, the average proportion of a^best = £in decreases monotonically 
with <7, as expected (the larger a is the more likely Xbest is to be different from 
cctrue)- For example, for a = 0.25 this proportion is above 0.99 for all n = 6,7, 8, 
while for a = 1.25 it might be as low as 0.05 (for n = 8). See also Remark]]] in 
Section liTTl 

2. In the view of the previous observation, for each n the values of a vary 
from that corresponding to a relatively ’’easy” decoding task (a = 0.25) to that 
corresponding to a ’’difficult but possible” decoding task (a = 1.25), in the sense 
of Section 14.11 

Motivated by Observation 2, we will select the same values of a in all numer¬ 
ical experiments below, to investigate how the performance of various numerical 
schemes for the solution of the decoding task depends on the standard deviation 
of noise and dimensionality of the matrix. 

5.3 Experiment 2 

In this experiment, we investigate the performance of the nearest neighbors 
search in X (see Section H31) . 

Description. For each n = 6,7,8 and each a = 0.25,0.5,0.75,1,1.25, we 
proceed as in Experiment 1 from Section 15.21 Then, for each received message 
y 0 b s and each k = l,2n + l,2n 2 + l,n 3 ,n 4 ,n 5 + 1, we evaluate the estimate 
Xnn(k) of ^true via the algorithm from Section 03] (see (1441) 1. Then, among all 
cases when Strue = ®best, we evaluate the proportion of cases when the estimate 
x nn (k) is correct; in other words, we compute the number prop nn (n, a , k) defined 
via the formula 


PrOPnnOb °5 fc ) 


ff {-^best — *£true — ^nn(fc) } 
ff {*Ebest -Ttrue} 


(105) 


As in Experiment 1 from Section 15.21 to evaluate each such proportion we use 
5,000 messages (five matrices per n, L = 1,000 messages per matrix). 
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Remark 3. According to (I105D . in this experiment we evaluate the performance 
of the scheme only on those messages Xtme for which the maximal likelihood 
estimate Xbest coincides with Xtrue- 

Tables. The results of Experiment 2 are displayed in Tables 0] [S] These 
tables correspond, respectively, to n = 6, 7,8, and contain prop nn (n, a, k) defined 
via (11H5D above (in each table, rows correspond to the values of cr and columns 
correspond to the values of k). 

In addition, for each n and k, we measure the CPU time required by this 
scheme to evaluate x nn ^) for L = 1,000 messages (obviously, this CPU time 
essentially does not depend on cr: see Section l4~5ll . These CPU times are listed in 
the last five columns of Table[7] Each row of this table corresponds to n = 6, 7, 8, 
while the last five columns of this table correspond to k = 2n, 2n 2 , n 3 , n 4 , n 5 , 
respectively. 



k 

a 

1 

2n + 1 

2 n z + 1 

n 3 

n 4 

n b + 1 

0.25 

0.659E+00 

0.825E+00 

0.906E+00 

0.939E+00 

0.973E+00 

0.991E+00 

0.5 

0.242E+00 

0.458E+00 

0.645E+00 

0.741E+00 

0.852E+00 

0.922E+00 

0.75 

0.108E+00 

0.281E+00 

0.447E+00 

0.559E+00 

0.717E+00 

0.862E+00 

1 

0.693E-01 

0.194E+00 

0.362E+00 

0.468E+00 

0.660E+00 

0.812E+00 

1.25 

0.374E-01 

0.144E+00 

0.286E+00 

0.385E+00 

0.596E+00 

0.805E+00 


Table 4: Success rate of nearest neighbors search in X (see (11051) in Section [57S\ ) . 
Corresponds to Experiment 2 with n = 6. 



k 

a 

1 

2n + 1 

2 rU + 1 

n 3 

n 4 

n b + 1 

0.25 

0.603E+00 

0.794E+00 

0.897E+00 

0.926E+00 

0.971E+00 

0.993E+00 

0.5 

0.226E+00 

0.418E+00 

0.568E+00 

0.658E+00 

0.808E+00 

0.917E+00 

0.75 

0.996E-01 

0.236E+00 

0.374E+00 

0.483E+00 

0.648E+00 

0.796E+00 

1 

0.489E-01 

0.139E+00 

0.262E+00 

0.359E+00 

0.524E+00 

0.679E+00 

1.25 

0.311E-01 

0.907E-01 

0.184E+00 

0.251E+00 

0.425E+00 

0.598E+00 


Table 5: Success rate of nearest neighbors search in X (see (11051) in Section [57S\ ) . 
Corresponds to Experiment 2 with n = 7. 

Observations. We make the following observations from Tables [4][6] 

1. As expected, for each n and a the performance of the scheme improves as 
k increases (obviously, this improvement comes at the cost of additional CPU 
time). For example, for n = 8 and cr = 0.75 the proportion of correct ’’guesses” 
grows from about 2% to about 58% as k increases from 1 to n 5 . 

2. As expected, for each n and k the performance of the scheme deteriorates 
as a increases (noise of a larger standard deviation makes the decoding task 
more difficult). 

3. Typically, for each cr and comparable values of k, the performance of the 
scheme deteriorates as n increases. 
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k 

a 

1 

2n + 1 

2 n A + 1 

rr 

n 4 

n b + 1 

0.25 

0.430E+00 

0.569E+00 

0.655E+00 

0.699E+00 

0.765E+00 

0.821E+00 

0.5 

0.112E+00 

0.244E+00 

0.369E+00 

0.450E+00 

0.566E+00 

0.666E+00 

0.75 

0.298E-01 

0.101E+00 

0.197E+00 

0.287E+00 

0.442E+00 

0.575E+00 

1 

0.920E-02 

0.420E-01 

0.125E+00 

0.192E+00 

0.311E+00 

0.466E+00 

1.25 

0.816E-02 

0.245E-01 

0.571E-01 

0.106E+00 

0.196E+00 

0.363E+00 


Table 6: Success rate of nearest neighbors search in X (see (11051) in Section, ROD . 
Corresponds to Experiment 2 with n = 8. 





k 

n 

RaDel 

RaDe2 

2 n 

2 n z 

n 3 

n 4 

n 5 

6 

0.46E-1 

0.14E+0 

0.20E-1 

0.75E-1 

0.22E+0 

0.13E+1 

0.76E+1 

7 

0.50E-1 

0.12E+0 

0.26E-1 

0.14E+0 

0.40E+0 

0.27E+1 

0.19E+2 

8 

0.54E-1 

0.15E+0 

0.33E-1 

0.27E+0 

0.80E+0 

0.56E+1 

0.45E+2 


Table 7: CPU time of various decoding schemes (in seconds). The number of 
messages is L = 1000. For RaDel and RaDe2, we used a = 0.75. 


4. When n = 8 and a = 1,1.25, the scheme returns the correct answer in 
less than half of all cases for all values of k in Table ED In other words, when, 
for example, a = 1, in more than half of all cases a;t rU e is not among as many as 
32,000 nearest neighbors of i7 _1 • y Q b s in X , even when cctrue = £best (see CUD 
in Section POD . 

Some additional observations can be made from Table [7] 

5. Naturally, the CPU time of nearest neighbor search in A' scales roughly 
as one would expect from (l46l) in Section [4751 

6. Typically, even a noticeable increase in the CPU time (e.g. taking a 
larger k ) results in a fairly modest improvement in performance. For example, 
when n = 8 and <j = 1, using k = 8 4 nearest neighbors to decode the message 
results in about 31% of correct guesses (and it takes about 6 seconds per 1000 
messages). Increasing the number of nearest neighbors by a factor of eight allows 
one to determine about half as many additional messages correctly (e.g. about 
46% overall), while the CPU time goes up to about 45 seconds. 


5.4 Experiment 3 

In this experiment, we investigate the performance of Randomized Decoder 1 
(see Section l4~6l) . 

Description. For each n = 6,7,8 and each a = 0.25,0.5,0.75,1,1.25, we 
proceed as in Experiment 1 (see Section l5T2ll . Then, for each received message 
y 0 bs and each T = 1,..., 7, we evaluate the estimate £R, a Dei(F) of cctrue from 
y 0 bs via RaDel (see Section ROH) . with the following parameters (see (RSI) in 
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Section 14.6.41 and also Table [5] in Section [5.1.3D : 


min_RaDel = T, 
max_RaDel = T , 

Xthresh : not applicable (106) 

(the parameter Xthresh is not applicable since min_RaDel = max_RaDel = T, 
e.g. the number of basic iterations is always exactly T). Then, we define the 
integer k\ via the formula 


ki = 2 ■ n 2 . (107) 

and evaluate the estimate x supel (T ) of Xtrue from y Q b s via RaDel (where the 
parameters are defined via (I106D 1 followed by supercharging (see Section 14751 in 
particular (1551) ): during the supercharging step, k\ nearest neighbors are used. 
In other words, XR a Dei(F) and x supe r(T) are obtained from y Q b s via calling 

RaDel .search ( y ohs ; XR aD ei(P), r, \) 

RaDel_search (y a b s ; x, r, x) 

Super (^/obsj kl: X, 7’, X i *^super(7^), 7" SU per: Xsuper) (108) 

(see (1551) . (1551) 1. where the parameters are defined via (I106D and (11071) . 

Thus, for each n and a, we have 5,000 messages j/ 0 bs (five matrices per 
n, L = 1,000 messages per matrix, as in Experiments 1,2 from Sections 15.21 
EM- for each such message, we obtain 14 estimates XR a Dei(l), ■ • •, XR a Dei(7) 
and x sup er(1),..., x sup er (7) of Xtrue- For each such estimate, we evaluate the 
proportion of the cases in which this estimate is correct (provided that Xtrue = 
Xbest, see Section [5.2D . In other words, we compute prop RaDel (n, cr, T) and 
prop super (n, <r, T) via the formulae 


PrOPRaDelK 0 ! T ) 


{xbest — Xtrue — •ERa.DeRT)} 
4 ^ {Xbest — Xtrue} 


(109) 


and 


prOPs U per (tX, (7, T) 


7^ {xbest — Xt rue — *^super(T) 
i if 1 {-^best — *^true} 


( 110 ) 


respectively (see also Remark [3] in Section EM - 

Tables. The results of Experiment 3 are displayed in Tabic [5] In this table, 
the rows correspond to all pairs of n = 6, 7,8 and T = 1,..., 7, and the columns 
correspond to a = 0.25,0.5,0.75,1,1.25. For each n, cr, T, the table contains 
two entries that appear one under the other: prop RaDel (77., cr, T) (above) and 
prop supe r(7T',T) (below); see (1109D . (1110D . For example, for n = 6, T = 1 and 
a = 0.5, these proportions are equal, respectively, to 0.211 and 0.411. 

In addition, for each n = 6, 7, 8 and for cr = 0.75, we measure the CPU time 
required to process L = 1, 000 messages by a single iteration of this algorithm. 
These CPU times are listed in the second column of Table [7] 

Observations. The following observations can be made from Table [5] and 
some additional experiments by the authors. 
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<7 

n 

min RaDel 

0.25 

0.5 

0.75 

1 

1.25 

6 

1 

0.587E+00 

0.749E+00 

0.211E+00 

0.411E+00 

0.103E+00 

0.268E+00 

0.737E-01 

0.219E+00 

0.588E-01 

0.168E+00 

6 

2 

0.737E+00 

0.843E+00 

0.342E+00 

0.549E+00 

0.185E+00 

0.381E+00 

0.128E+00 

0.316E+00 

0.802E-01 

0.222E+00 

6 

3 

0.791E+00 

0.879E+00 

0.427E+00 

0.620E+00 

0.267E+00 

0.465E+00 

0.207E+00 

0.404E+00 

0.142E+00 

0.337E+00 

6 

4 

0.836E+00 

0.904E+00 

0.478E+00 

0.667E+00 

0.297E+00 

0.502E+00 

0.212E+00 

0.426E+00 

0.160E+00 

0.353E+00 

6 

5 

0.858E+00 

0.922E+00 

0.509E+00 

0.688E+00 

0.343E+00 

0.545E+00 

0.258E+00 

0.472E+00 

0.198E+00 

0.422E+00 

6 

6 

0.878E+00 

0.932E+00 

0.554E+00 

0.719E+00 

0.355E+00 

0.558E+00 

0.296E+00 

0.473E+00 

0.222E+00 

0.409E+00 

6 

7 

0.892E+00 

0.940E+00 

0.582E+00 

0.735E+00 

0.374E+00 

0.574E+00 

0.317E+00 

0.494E+00 

0.225E+00 

0.404E+00 

7 

1 

0.678E+00 

0.839E+00 

0.310E+00 

0.526E+00 

0.155E+00 

0.334E+00 

0.900E-01 

0.227E+00 

0.622E-01 

0.174E+00 

7 

2 

0.793E+00 

0.902E+00 

0.419E+00 

0.631E+00 

0.242E+00 

0.465E+00 

0.155E+00 

0.325E+00 

0.137E+00 

0.267E+00 

7 

3 

0.844E+00 

0.929E+00 

0.487E+00 

0.689E+00 

0.293E+00 

0.514E+00 

0.182E+00 

0.366E+00 

0.137E+00 

0.324E+00 

7 

4 

0.875E+00 

0.942E+00 

0.541E+00 

0.731E+00 

0.348E+00 

0.562E+00 

0.245E+00 

0.444E+00 

0.192E+00 

0.373E+00 

7 

5 

0.903E+00 

0.955E+00 

0.576E+00 

0.756E+00 

0.364E+00 

0.575E+00 

0.261E+00 

0.446E+00 

0.199E+00 

0.376E+00 

7 

6 

0.910E+00 

0.959E+00 

0.600E+00 

0.771E+00 

0.400E+00 

0.605E+00 

0.273E+00 

0.465E+00 

0.210E+00 

0.376E+00 

7 

7 

0.924E+00 

0.964E+00 

0.624E+00 

0.790E+00 

0.426E+00 

0.619E+00 

0.307E+00 

0.513E+00 

0.220E+00 

0.412E+00 

8 

1 

0.757E+00 

0.900E+00 

0.288E+00 

0.514E+00 

0.129E+00 

0.296E+00 

0.631E-01 

0.184E+00 

0.367E-01 

0.147E+00 

8 

2 

0.845E+00 

0.941E+00 

0.393E+00 

0.617E+00 

0.186E+00 

0.376E+00 

0.105E+00 

0.247E+00 

0.653E-01 

0.188E+00 

8 

3 

0.883E+00 

0.959E+00 

0.446E+00 

0.658E+00 

0.233E+00 

0.425E+00 

0.122E+00 

0.263E+00 

0.898E-01 

0.216E+00 

8 

4 

0.902E+00 

0.965E+00 

0.475E+00 

0.683E+00 

0.250E+00 

0.457E+00 

0.154E+00 

0.302E+00 

0.114E+00 

0.257E+00 

8 

5 

0.910E+00 

0.965E+00 

0.508E+00 

0.697E+00 

0.278E+00 

0.475E+00 

0.152E+00 

0.327E+00 

0.102E+00 

0.282E+00 

8 

6 

0.920E+00 

0.968E+00 

0.539E+00 

0.719E+00 

0.290E+00 

0.486E+00 

0.173E+00 

0.357E+00 

0.114E+00 

0.278E+00 

8 

7 

0.931E+00 

0.974E+00 

0.546E+00 

0.729E+00 

0.300E+00 

0.493E+00 

0.184E+00 

0.350E+00 

0.114E+00 

0.294E+00 


Table 8: Success rate of RaDel, with and without supercharging (k\ = 2 n 2 ). 
Corresponds to Experiment 3 in Section GE3 
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1. Not surprisingly, for each n and u the performance of the scheme improves 
as the number of iterations T increases (obviously, this improvement comes at 
the cost of additional CPU time). See, however, Remark H] below. 

2. As expected, for each n, a and T, the performance of the scheme with 
supercharging is better than without supercharging (again, at the cost of addi¬ 
tional CPU time). 

3. The effects of supercharging are generally rather negligible when the 
proportion of ’’correct guesses” is already high. For example, when n = 8, 
a = 0.25 and T = 7, supercharging improves the success rate from 93.1% to 
97.4%. On the other hand, the improvement might be quite noticeable when 
the success rate is relatively low. For example, when n = 8, a = 0.75 and T = 7, 
supercharging improves the success rate from 30% to 49%. 

4. As expected, for each n and T the performance of the scheme deteriorates 
as cr increases (noise of a larger standard deviation makes the decoding task more 
difficult). 

Remark 4. Additional numerical experiments seem to indicate that, for suffi¬ 
ciently large noise, the success rate of RaDel does not approach 100% as the 
number of iterations increases to some reasonable value (e.g. T = 100,), but 
seems to be stuck at some intermediate value (e.g. 70%). This phenomenon 
is likely to be related to the fact that RaDel is not fully randomized, i.e. the 
parameter j from Table [7| (see also Section \4-6.2\ ) is fixed (and is determined 
by H). To get rid of this undesirable feature, in the future we will investigate 
the possibility of choosing j at random, probably taking into account the values 
S n -i(j) (see (l501l in Section \4.6.1\ and also Table [7]). 

Some additional observations can be made from Table [7] 

5. A single iteration of RaDel (see Section 14.6.311 is faster than the nearest 
search in X with k = 2 • n 2 (see Section l4~5l) by a factor of about 1.6 for n = 6 
and by a factor of about 5 for n = 8. Needless to say, for larger values of k the 
difference in CPU times is even more significant. For example, for n = 8 and 
k = n 5 the nearest neighbors search in X is about 800 times slower. 

6. Since the CPU time required by a a nearest neighbors search in X with 
k = 2 • n 2 is of the same order of magnitude as that required by a single iteration 
of RaDel, it might make sense to perform fewer iterations of the basic scheme 
followed by a supercharging rather than performing more iterations of the basic 
scheme without supercharging (see also Table HJ). 


5.5 Experiment 4 

In this experiment, we investigate the performance of Randomized Decoder 2 
(see Section PTTll . 

Description. This experiment is similar to that described in Section 15.41 
with the difference that instead of RaDel (see Section 14.61) we use RaDe2 (see 
Section lOl) . More specifically, for each n = 6, 7, 8 and each a = 0.25, 0.5,0.75,1,1.25, 
we proceed as in Experiment 1 from Section l5.2l Then, for each received message 
7/obs and each T = 1,..., 7, we evaluate the estimate iRaDe 2 (P) of irtrue from 
y 0 bs via RaDe2, with the following parameters (see (1501) . (1501) in Section 14.7.41 
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and also Tabic [2] in Section [5.1.31) : 


min_RaDe2 = T, 
max_RaDe2 = T, 

Xstop = 10 , 

Xthresh : not applicable (111) 

(the parameter Xthresh is not applicable since min_RaDe2 = max_RaDe2 = T, 
e.g. the number of basic iterations is always exactly T). Then, we define 
the integer ki via (I107D above, and evaluate the estimate x supel (T) of Xtrue 
from y 0 bs via RaDe2 (where the parameters are defined via (11111) ) followed by 
supercharging (see Section 14.81 in particular (198( 1: during the supercharging 
step, fci nearest neighbors are used. In other words, x Ra De 2 CF) an d x sup er(F) 
are obtained from y 0 b s via calling 

RaDe2_search (y obs ; x RaDe 2 {T),r, x) 

RaDe2_search (y a b s ; x, r, x) 

super (y obs -ki,x,r,X', •^super CO, ^super ? Xsuper) (H2) 

(see m, «, where the parameters are defined via cnD and HMD- 

Thus, for each n and cr, we have 5,000 messages y Q b s (five matrices per n , 
L = 1,000 messages per matrix, as in Experiments 1,2 from Sections 15.21 15.31 
El]); for each such message, we obtain 14 estimates x Ra De 2 (l), • • •, x Ra De2(7) 
and x sup er(1), •••, x SU per(7) of Xtrue- For each such estimate, we evaluate the 
proportion of the cases in which this estimate is correct (provided that Xt rU e = 
Xbest, see Section [531) . In other words, we compute prop RaDe2 (n, cr, T) and 
prop super (n, cr, T) via the formulae 


prop RaDe 2 O, O 


^ {xbest — Xtrue — X Ra b) e 2(T)} 
^ {Xbest — Xtrue} 


and 


PrOPsuperO,^, O 


^ {xbest — Xtrue — ^super(T) } 
^ {Xbest — Xtrue} 


(113) 


(114) 


respectively (see also Remark [3] in Section HOI) . 

Tables. The results of Experiment 4 are displayed in Table [9j whose struc¬ 
ture is similar to that of Table [5] More specifically, in this table, the rows corre¬ 
spond to all pairs of n = 6, 7, 8 and T = 1,... ,7, and the columns correspond to 
a = 0.25,0.5,0.75,1,1.25. For each n,a,T, the table contains two entries that 
appear one under the other: prop RaDe2 (n., <r, T) (above) and prop sup er (n,a,T) 
(below); see (11131) . (11141) . For example, for n = 6, T = 1 and a = 0.5, these 
proportions are equal, respectively, to 0.533 and 0.775. 

In addition, for each n = 6, 7, 8 and for a = 0.75, we measure the CPU time 
required to process L = 1,000 messages by a single iteration of this algorithm. 
These CPU times are listed in the third column of Table [3 

Observations. The following observations can be made from Table [9] and 
some additional experiments by the authors. These observations are somewhat 
similar to those from Section HTTl 
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a 

n 

min RaDe2 

0.25 

0.5 

0.75 

1 

1.25 

6 

1 

0.969E+00 

0.987E+00 

0.752E+00 

0.902E+00 

0.533E+00 

0.775E+00 

0.429E+00 

0.696E+00 

0.302E+00 

0.612E+00 

6 

2 

0.989E+00 

0.997E+00 

0.871E+00 

0.955E+00 

0.713E+00 

0.870E+00 

0.609E+00 

0.804E+00 

0.529E+00 

0.735E+00 

6 

3 

0.995E+00 

0.998E+00 

0.912E+00 

0.965E+00 

0.798E+00 

0.915E+00 

0.697E+00 

0.849E+00 

0.642E+00 

0.821E+00 

6 

4 

0.996E+00 

0.998E+00 

0.938E+00 

0.978E+00 

0.841E+00 

0.930E+00 

0.771E+00 

0.892E+00 

0.709E+00 

0.866E+00 

6 

5 

0.998E+00 

0.100E+01 

0.952E+00 

0.984E+00 

0.871E+00 

0.948E+00 

0.807E+00 

0.926E+00 

0.781E+00 

0.874E+00 

6 

6 

0.998E+00 

0.999E+00 

0.962E+00 

0.985E+00 

0.892E+00 

0.960E+00 

0.839E+00 

0.930E+00 

0.807E+00 

0.912E+00 

6 

7 

0.998E+00 

0.100E+01 

0.968E+00 

0.988E+00 

0.909E+00 

0.962E+00 

0.860E+00 

0.949E+00 

0.834E+00 

0.914E+00 

7 

1 

0.838E+00 

0.895E+00 

0.536E+00 

0.714E+00 

0.349E+00 

0.562E+00 

0.262E+00 

0.448E+00 

0.202E+00 

0.446E+00 

7 

2 

0.918E+00 

0.948E+00 

0.657E+00 

0.804E+00 

0.488E+00 

0.673E+00 

0.387E+00 

0.578E+00 

0.334E+00 

0.510E+00 

7 

3 

0.954E+00 

0.973E+00 

0.727E+00 

0.854E+00 

0.556E+00 

0.735E+00 

0.440E+00 

0.630E+00 

0.396E+00 

0.570E+00 

7 

4 

0.963E+00 

0.978E+00 

0.784E+00 

0.887E+00 

0.598E+00 

0.757E+00 

0.494E+00 

0.688E+00 

0.415E+00 

0.624E+00 

7 

5 

0.976E+00 

0.985E+00 

0.802E+00 

0.896E+00 

0.648E+00 

0.785E+00 

0.535E+00 

0.690E+00 

0.456E+00 

0.655E+00 

7 

6 

0.981E+00 

0.989E+00 

0.834E+00 

0.913E+00 

0.674E+00 

0.818E+00 

0.564E+00 

0.735E+00 

0.503E+00 

0.699E+00 

7 

7 

0.982E+00 

0.988E+00 

0.859E+00 

0.934E+00 

0.705E+00 

0.837E+00 

0.619E+00 

0.748E+00 

0.516E+00 

0.692E+00 

8 

1 

0.764E+00 

0.863E+00 

0.370E+00 

0.554E+00 

0.229E+00 

0.393E+00 

0.158E+00 

0.281E+00 

0.155E+00 

0.224E+00 

8 

2 

0.888E+00 

0.941E+00 

0.494E+00 

0.699E+00 

0.314E+00 

0.506E+00 

0.242E+00 

0.427E+00 

0.200E+00 

0.347E+00 

8 

3 

0.935E+00 

0.966E+00 

0.586E+00 

0.760E+00 

0.400E+00 

0.588E+00 

0.298E+00 

0.480E+00 

0.249E+00 

0.400E+00 

8 

4 

0.958E+00 

0.980E+00 

0.652E+00 

0.813E+00 

0.420E+00 

0.626E+00 

0.334E+00 

0.523E+00 

0.290E+00 

0.465E+00 

8 

5 

0.970E+00 

0.982E+00 

0.704E+00 

0.841E+00 

0.461E+00 

0.665E+00 

0.363E+00 

0.568E+00 

0.306E+00 

0.527E+00 

8 

6 

0.977E+00 

0.989E+00 

0.732E+00 

0.859E+00 

0.512E+00 

0.697E+00 

0.393E+00 

0.598E+00 

0.302E+00 

0.502E+00 

8 

7 

0.977E+00 

0.992E+00 

0.759E+00 

0.873E+00 

0.545E+00 

0.717E+00 

0.417E+00 

0.622E+00 

0.339E+00 

0.518E+00 


Table 9: Success rate of RaDe2, with and without supercharging (k\ = 2 n 2 ). 
Corresponds to Experiment 4 in Section \5 .51 
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1. Not surprisingly, for each n and a the performance of the scheme improves 
as the number of iterations T increases (obviously, this improvement comes at 
the cost of additional CPU time). See, however, Remark [5] below. 

2. As expected, for each n, a and T, the performance of the scheme with 
supercharging is better than without supercharging (again, at the cost of addi¬ 
tional CPU time). 

3. The effects of supercharging are generally rather negligible when the 
proportion of ’’correct guesses” is already high. For example, when n = 8, 
a = 0.25 and T = 3, supercharging improves the success rate from 93.5% to 
96.6%. On the other hand, the improvement might be quite noticeable when 
the success rate is relatively low. For example, when n = 8, a = 0.75 and T = 2, 
supercharging improves the success rate from 31.4% to 50.6%. 

4. As expected, for each n and T the performance of RaDe2 deteriorates as 
a increases (noise of a larger standard deviation makes the decoding task more 
difficult). 

Remark 5. Additional numerical experiments seem to indicate that, for suffi¬ 
ciently large noise, the success rate of RaDe2 does not approach 100% as the 
number of iterations increases to some reasonable value (e.g. T = 100^1, but 
seems to be stuck at some intermediate value (e.g. 70%). In addition, this 

intermediate value is larger that the related value of RaDel (see Remark [7] in 
Section GEZP- This phenomenon is likely to be related to the fact that RaDe2 
is not fully randomized, i.e. the parameters ji,j 2 from Table [7| (see also Sec- 
tion \4.7.if\ ) are fixed (and are determined by H). To get rid of this undesirable 
feature, in the future we will investigate the possibility of choosing ji,j 2 at ran¬ 
dom, probably taking into account the values <SVi- 2 (ji), Sn- 2 (j 2 ) (see (1501) in 
Section \4.6.1\ and also Table [T]j. 

Some additional observations can be made from Table 0 

5. A single iteration of RaDe2 (see Section 14.7.31) is typically somewhat 
slower than a single iteration of RaDel (by a factor between 2.4 and 3.1 for 
n = 6, 7, 8 and a = 0.75). Obviously, this is expected from (1(771) . (17)11) . 

6. Since the CPU time required by a a nearest neighbors search in X with 
k = 2 • n 2 is of the same order of magnitude as that required by a single iteration 
of RaDe2, it might make sense to perform fewer iterations of the basic scheme 
followed by a supercharging rather than performing more iterations of the basic 
scheme without supercharging (see also Table 0. 

5.6 Experiment 5 

In this experiment, we compare the performance of the three decoding schemes: 
nearest neighbors search in X, RaDel and RaDe2 (see Sections 14.51 14.61 14.71 
respectively). Roughly speaking, the purpose of this experiment is to determine 
empirically how long it takes for each of the schemes to achieve a certain success 
rate (for several choices of n and noise deviation a). In this sense, this experi¬ 
ment can be viewed simply as a comparison of the results of Experiments 2,3,4 
above. For the sake of completeness, however, we provide a detailed description 
below. 

For each n = 6,7,8 and each a = 0.25,0.75,1.25, we proceed as follows. 
First, we repeat Experiment 1 from Section 157^1 thus, for each of the 5,000 ran¬ 
domly selected transmitted messages Xtrue i n X, we obtain a received message 
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y 0 bs in C ra and a maximum likelihood estimate Xbest of Xtrue (see Section lS^l) . As 
in Experiments 2,3,4 above, to evaluate the performance of a decoding scheme 
we use only those messages for which Xbest = Xtrue (see Remark [5] in Section [53]). 

Then we choose, more or less arbitrarily, the desired success rate 0 < p < 1, 
and, for each of the schemes from Sections l4.5l 14.6114.71 we choose the parameters 
in such a way that the success rate of the scheme is roughly equal to p (see e.g. 
(I105D in Section 1531 (11091) . (11101) in Section PT~T1 (11131) . (II141) in Section 10)1) . 
More specifically: 

for the nearest neighbors search in A', we select the appropriate number of 
nearest neighbors k (see Section POll : 

for RaDel, we select the number of iterations T, the number of nearest 
neighbors k\ for supercharging (zero if no supercharging; see also HMD), and 
use the parameters from (I106D : 

for RaDe2, select the number of iterations T, the number of nearest neighbors 
ki for supercharging (zero if no supercharging; see also and use the 

parameters from m- 

For each of the schemes from Sections piiTOrm we measure the propor¬ 
tion of correct guesses (prop nn (n, cr), prop RaDel (n, cr), prop RaDe2 (n, cr), respec¬ 
tively; see (11051) . (11091) . (11101) . (11131) . (11141) ). In addition, for each scheme, we 
measure the CPU times required to decode 1,000 messages (see e.g. Table 0. 

Tables. The results of this experiment are displayed in Tables [TO] [18] 
Each of this tables corresponds to a certain choice of n and cr, and has the 
following structure. The columns correspond to the decoding schemes from 
Sections 14.5114.6114.71 respectively (i.e. nearest neighbors search in A, RaDel 
and RaDe2). In the first row, we list the scheme’s parameters (k for the nearest 
neighbors search in A, number of iterations T and supercharging parameter 
k\ for each of RaDel and RaDe2). The second row contains the proportion 
of correctly decoded messages (provided that Xtrue = Xbest)> he. prop nn (n, cr), 
prop RaDel (n, cr), prop RaDe2 (n, a) , respectively (by design, we expect these pro¬ 
portions to be roughly the same across all columns). The third row contains 
the CPU time (in seconds) required to decode 1,000 messages; these times are 
used to compare the performance of the schemes (the faster the better). 

For example, for n = 6 and cr = 0.25 (see Table [Toll . it takes the scheme 
from Section 14.51 (with k = 2n 2 + 1) to achieve the success rate of 91% in 
0.075 seconds per 1,000 messages; on the other hand, one iteration of RaDe2 
(without supercharging) achieves the success rate of 97% in 0.052 seconds per 
1,000 messages. 



NN in A 

RaDel 

RaDe2 

parameters 

k = 2t i z + 1 

4 iterations, k\ = 2 n 

1 iteration, k\ = 0 

proportion 

0.91E+0 

0.90E+0 

0.97E+0 

CPU time 

0.75E-1 

0.20E+0 

0.52E-1 


Table 10: Corresponds to Experiment 5 with n = 6, cr = 0.25 (see Section [57B\ ) . 

Observations. The following observations can be made from TablesQIj] [lH] 
and some additional experiments by the authors. In all of these observations, 
by ’’better performance” we mean ’’takes less CPU time to achieve a similar 
success rate”. All the times are in seconds per 1,000 messages. 
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NN in X 

RaDel 

RaDe2 

parameters 

k = 2 ri z + 1 

2 iterations, k\ = 2 n 

2 iterations, k\ = 0 

proportion 

0.90E+0 

0.90E+0 

0.92E+0 

CPU time 

0.14E+0 

0.11E+0 

0.10E+0 


Table 11: Corresponds to Experiment 5 with n = 7, cr = 0.25 (see Section \5.6\) . 



NN in X 

RaDel 

RaDe2 

parameters 

k = n b + 1 

2 iterations, k\ = 0 

2 iterations, k\ = 0 

proportion 

0.82E+0 

0.85E+0 

0.88E+0 

CPU time 

0.45E+2 

0.10E+0 

0.13E+0 


Table 12: Corresponds to Experiment 5 with n = 8, a = 0.25 (see Section ] 5. 60 . 



NN in X 

RaDel 

RaDe2 

parameters 

£ 

II 

20 iterations, k\ = 2 n 2 

3 iterations, ki = 2 ri 2 

proportion 

0.72E+0 

0.70E+0 

0.71E+0 

CPU time 

0.13E+1 

0.10E+1 

0.27E+0 


Table 13: Corresponds to Experiment 5 with n = 6, a = 0.75 (see Section roi ). 



NN in X 

RaDel 

RaDe2 

parameters 

II 

4- 

9 iterations, k\ = 2 ri 2 

2 iterations, k\ = 2 ri 2 

proportion 

0.65E+0 

0.65E+0 

0.67E+0 

CPU time 

0.27E+1 

0.58E+0 

0.35E+0 


Table 14: Corresponds to Experiment 5 with n = 7, a = 0.75 (see Section \5.6\) . 



NN in X 

RaDel 

RaDe2 

parameters 

k = n b + 1 

20 iterations, k\ = 2 n z 

3 iterations, k\ = 0 

proportion 

0.58E+0 

0.57E+0 

0.59E+0 

CPU time 

0.45E+2 

0.13E+1 

0.61E+0 


Table 15: Corresponds to Experiment 5 with n = 8, <r = 0.75 (see Section [57U\) . 



NN in X 

RaDel 

RaDe2 

parameters 

-t 

£ 

II 

25 iterations, k\ = 2 ri 1 

1 iteration, k\ = 2 n 2 

proportion 

0.60E+0 

0.59E+0 

0.61E+0 

CPU time 

0.13E+1 

0.12E+1 

0.30E+0 


Table 16: Corresponds to Experiment 5 with n = 6, a = 1.25 (see Sec.tion YhlX) . 
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NN in X 

RaDel 

RaDe2 

parameters 

k = n b + 1 

50 iterations, k\ = 2 n 2 

4 iterations, k\ = 2 n 2 

proportion 

0.60E+0 

0.59E+0 

0.63E+0 

CPU time 

0.19E+2 

0.26E+1 

0.88E+0 

Table 17: Corresponds to Experiment 5 with n = 7, cr 

= 1.25 (see SectionUrtH) 


NN in X 

RaDel 

RaDe2 

parameters 

k = n b + 1 

20 iterations, k\ = 2 n 2 

3 iterations, k\ = 2 n 2 

proportion 

0.36E+0 

0.36E+0 

0.38E+0 

CPU time 

0.45E+2 

0.13E+1 

0.82E+0 


Table 18: Corresponds to Experiment 5 with n = 8, a = 1.25 (see Section \5J)\) . 


1. When the noise is relatively small (cr = 0.25), the performance of RaDel 
is similar to that of RaDe2 (moreover, even a single iteration of each scheme 
takes roughly the same time; compare to Table [7] that corresponds to cr = 0.75, 
and see also (EZI). (ED). For example, for n = 8, it takes RaDel 0.1 seconds to 
achieve the success rate of 85%, while it takes RaDe2 0.13 seconds to achieve 
the success rate of 88% (see Table ED. 

2. Even for cr = 0.25, the performance of the nearest neighbors search in X 
strongly depends on n (for similar success rates). For example, when n = 6 or 
n = 7, this scheme achieves the success rate of about 90% in time comparable 
to that of RaDel and RaDe2 (see Tables flOl ITT! in these, k = 2n 2 + 1). On 
the other hand, when n = 8, the nearest neighbors search in X requires already 
k = n 5 to achieve the success rate of 82%; subsequently, it is slower than both 
RaDel and RaDe2 schemes by a factor of about 40 (see Table ED. 

3. When u = 0.75, RaDe2 outperforms RaDel somewhat. More specifically, 
it achieves a slightly higher success rate (71% vs. 70n = 8) faster (by a factor 
of 3.7 for n = 6, by a factor of 1.7 for n = 7, and by a factor of 2.1 for n = 8). 
In this case, while a single iteration of RaDe2 is slower than a single iteration 
of RaDel (see Table Q, the latter requires more iterations to achieve a similar 
success rate (see Tables El ED- 

4. When cr = 0.75, the nearest neighbors search in A' is noticeably slower 
than RaDe2 (when the success rates are about 70%, 65% and 60% for n = 6, 7, 8, 
respectively). More specifically, it is slower by factors of 4.8, 7.7 and 74 for 
n = 6, 7,8, respectively (see Tables El ED- 

5. When the noise is large (cr = 1.25), both RaDel and RaDe2 typically 
perform somewhat better with supercharging than without it. In addition, in 
all experiments, RaDe2 outperforms RaDel (by factors of about 4,3 and 1.5 
with success rates of about 60%, 60% and 37% for n = 6,7, 8, respectively). 

6. For cr = 1.25, the algorithm from Section H75l requires a relatively large 
number k of nearest neighbors to achieve a success rate similar to that of RaDel 
and RaDe2, in all experiments (see Tables El ED- Consequently, the nearest 
neighbors search in X is slower than, say, RaDe2, by a factor of 4.3, 22 and 55 
with success rates about 60%, 60% and 37% for n = 6, 7,8, respectively. 

We conclude that the nearest neighbors search in X typically underperforms 
compared to both RaDel and RaDe2 decoding schemes (i.e. takes more time to 
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achieve a similar success rate); moreover, for n = 8 the difference can be quite 
noticeable. On the other hand, RaDe2 typically outperforms RaDel. 


6 Conclusions and Future Research 

In this paper, we presented several schemes for decoding of digital messages sent 
over a noisy multivariate Gaussian channel, and illustrated their performance 
via numerical experiments. 

Needless to say, there is always a gap between a prototype dealing with a 
single mathematical model and the multitude of applications; we are looking 
forward to collaborating with researchers and engineers in both industry and 
academia to shrink this gap in the case of our decoding schemes. 

Some future directions of our research in this area are related to the fact that 
our schemes need to be extensively tested in a demanding industrial environment 
on various choices of C,H,a, etc.; this will lead to further improvements and 
modifications. Other developments are related to the fact that the decoding 
problem described above admits certain variations (frequently encountered in 
applications). For example, often the coordinates of the transmitted message x 
are not independent of each other (e.g. there is a certain redundancy, as is in 
case of parity checks etc.). In other words, the set of possible message is not all 
of X but rather a subset of X (the reader is referred, for example, to [2], [5], 
m, m, Ha, m, m, for further information on the subject). In addition, 
certain messages might be more likely to be transmitted than others (based, 
for instance, on long-term statistics). Obviously, our schemes will need to be 
modified to take all such additional information into account. 
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